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A data-analysis strategy based on the maximum-likelihood method (MLM) is presented for 
the detection of gravitational waves from inspiraling compact binaries with a network of laser- 
interferometric detectors having arbitrary orientations and arbitrary locations around the globe. 
For simplicity, we restrict ourselves to the Newtonian inspiral waveform. However, the formalism we 
/— ^ ' develop here is also applicable to a waveform with post-Newtonian (PN) corrections. The Newtonian 

fvj , waveform depends on eight parameters: the distance r to the binary, the phase Sc of the waveform 

, ^ ' at the time of final coalescence, the polarization-ellipse angle tp, the angle of inclination e of the 

^ I binary orbit to the line of sight, the source-direction angles {6^, tj)}, the time of final coalescence tc at 

the fiducial detector, and the chirp time ^. All these parameters are relevant for a chirp search with 
multiple detectors, unlike the case of a single detector. The primary construct on which the MLM 
is based is the network likelihood ratio (LR). We obtain this ratio here. For the Newtonian inspiral 
waveform, the LR is a function of the eight signal-parameters. In the MLM-based detection strategy, 
the LR must be maximized over all of these parameters. Here, we show that it is possible to maxi- 
►^ , mize it analytically with respect to four of the eight parameters, namely, {r,5c,tp,e}. Maximization 

QQ ' over the time of arrival is handled most efficiently by using the Fast-Fourier- Transform algorithm, 

r~^ ' as in the case of a single detector. This not only allows us to scan the parameter space continu- 

("^ . ously over these five parameters but also cuts down substantially on the computational costs. The 

^\ • analytical maximization over the four parameters yields the optimal statistic on which the decision 

^^ I must be based. The value of the statistic also depends on the nature of the noises in the detectors. 

^^ ■ Here, we model these noises to be mainly Gaussian, stationary, and uncorrelated for every pair of 

^^ ' detectors. Instances of non-Gaussianity, as are present in detector outputs, can be accommodated 

^ , in our formalism by implementing vetoing techniques similar to those applied for single detectors. 

O^' Our formalism not only allows us to express the likelihood ratio for the network in a very simple 

;' ' and compact form, but also is at the basis of giving an elegant geometric interpretation to the de- 

l^i^, tection problem. Maximization of the LR over the remaining three parameters is handled as follows. 

Il" ' Owing to the arbitrary locations of the detectors in a network, the time of arrival of a signal at 

. 5^ I any detector will, in general, be difi'erent from those at the others and, consequently, will result in 

^^ . signal time-delays. For a given network, these time delays are determined by the source-direction 

$_( ' angles {0,0}. Therefore, to maximize the LR over the parameters {6^, 0} one needs to scan over 

^ , the possible time-delays allowed by a network. We opt for obtaining a bank of templates for the 

chirp time and the time-delays. This means that we construct a bank of templates over ^, 8, and 
0. We first discuss "idealized" networks with all the detectors having a common noise curve for 
simplicity. Such an exercise nevertheless yields useful estimates about computational costs, and also 
tests the formalism developed here. We then consider realistic cases of networks comprising of the 
LIGO and VIRGO detectors: These include two-detector networks, which pair up the two LIGOs or 
VIRGO with one of the LIGOs, and the three-detector network that includes VIRGO and both the 
LIGOs. For these networks we present the computational speed requirements, network sensitivities, 
and source-direction resolutions. 
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I. INTRODUCTION 

The existence of gravitational waves, which is predicted in the theory of general relativity, has long been verified 
'indirectly' through the observations of Hulse and Taylor [|l|. The inspiral of the members of the binary pulsar system 
named after them has been successfully accounted for in terms of the back-reaction due to the radiated gravitational 
waves [^||. However, detecting such waves with man-made 'antennas' has not been possible so far. Nevertheless, this 
problem has received a lot of attention this decade, especially, due to the arrival of laser-interferometric detectors, 
which are expected to have sensitivities close to that required for detecting such waves. 

A gravitational-wave (GW) source that is one of the most promising candidates for detection by Earth-based 
interferometric GW detectors is the inspiraling compact binary p|. Present estimates show a significant number of 
coalescence events every year of such binaries that produce waves strong enough to be detectable by current detectors 
during their inspiral phase, a few seconds before the onset of coalescence. Moreover, the time-evolution of these 
waveforms (chirps) is well understood in the frequency band where the present interferometric detectors are most 
sensitive. 

In the past, a sizable amount of research has been done on the problem of detecting gravitational waves using a 
single bar or interferometric detector. However, very little work has been devoted to develop techniques to analyze 
the data from a network of such detectors. Searching for chirps using a network of detectors is gaining importance 
due to (a) its superior sensitivity vis a vis that of a constituent detector and (b) improving feasibility for a real- 
time computational search. As has been argued before (see, e.g., Ref Q), for a given false-alarm probability, the 
threshold for detection is lowered as the number of detectors is increased. This increases the probability of detection 
by 'coherently' analyzing the signals from a network rather than a single detector. One can think of simpler approaches 
to the network problem where one matches event lists from different detectors in the network and sets up thresholds 
on the estimated parameter differences. Indeed, in the past, a formalism for interpreting coincidences of burst events 
in a pair of detectors has been suggested by Schutz and Tinto |5[|. However, such approaches, even if they were 
extended to the case of chirps, would be non-optimal, because they do not use the phase information of a signal at 
different detectors. The coherent search strategy described here crucially uses phase information. 

One of the early papers that came close to discussing the problem of detecting a Newtonian chirp using a network 
optimally was that of Finn and Chernoff |6) . This paper observed that since the orientations of the two LIGO detectors 
were very similar, their joint sensitivity was larger than any one of them. Bhawal and Dhurandhar also addressed the 
issue of detection using multiple detectors M] . Their main aim was to find the optimal recycling mode of operation of 
the planned laser interferometric detectors for which a meaningful coincidence detection of broadband signals could 
be performed. However, the issue of how a network of detectors with arbitrary orientations and arbitrary locations on 
the globe can be optimally used as a "single" detector of sensitivity higher than that of any of its subsets of individual 
detectors was not addressed in these earlier papers. 

Use of a detector network has nevertheless received considerable attention in the context of the parameter estimation 
problem. A formalism for using the responses of multiple detectors, in the absence of noise, to infer the parameters of a 
chirp (also known as the "inverse problem") was developed by Dhurandhar and Tinto |g,^. Some of the other notable 

12] . The prime motivation behind using a network 



er are the errors in estimated values of the binary 



works that address this issue in the presence of noise are Refs. [10- 

in this regard is that the larger the number of detectors, the small 

parameters. However, the starting point in these approaches is the assumption that the problem of detection has 

already been addressed and the detector-specific chirp-templates that result in "super-threshold" cross-correlations 

with the individual detector outputs, have been picked. 

A formalism was developed in Ref. p3[ that sought an optimal detection strategy for chirps in the simplifying case 
of a network with closely located laser-interferometric detectors and with idealized detector noise. This work was 
based on a coherent search. Its main result was that the optimal statistic for a network of up to three such detectors 
was proven to be the sum of the signal-to-noise (SNR) ratios of the individual detectors. It was also shown that the 
sensitivity of such a network improved as roughly the square-root of the number of detectors in the network. This 



formalism was extended to the case of arbitrarily located detectors in Ref. |14|, which showed a way to reduce the 
network statistic such that the number of chirp parameters over which a numerical search is required for detecting 
chirps drops from eight to three. This paves the way for a vast reduction in computational speed requirements and 
makes a multiple detector search for chirps much more feasible. One of the main aims of this paper is to formulate 
a data-analysis strategy that implements these formal findings in the case of existing and upcoming networks and to 
provide estimates on the required computational speeds, etc. 

As the members of a binary orbit around their center of mass, they lose energy in the form of gravitational waves. 
This results in their inspiral. Consequently, they emit gravitational waves with monotonically increasing amplitude 
and frequency p5[ . Although the gravitational waveform originating from an inspiraling binary is known accurately 
up to the 2.5 post- Newtonian (PN) order jl^l, nevertheless as a first calculation we limit our study to the detection of 



the Newtonian chirp. This is because our primary aim here is to develop the new formahsm, namely, that of optimally 
using the data from a network of detectors to detect a chirp. We find evidence of the applicability of our formalism to 
higher post-Newtonian orders. We also find for the Newtonian signal that there is essentially no correlation between 
the parameters describing the masses and the direction angles to the source when the noise curves are assumed to 
be identical for all the detectors in the network. This has the following important implications: The total number of 
templates is then just a product of number of templates for a single detector and the number of templates needed to 
scan source directions. If this property holds also for the PN case, then, in effect, we need to obtain the number of 
templates for the directions only and club together with this the information we have on the number of templates for 
a PN signal in a single detector. We hope to address this issue in detail in future work. 

In our analysis, we assume that the noise in each detector is predominantly stationary and Gaussian, with occasional 
contamination from non-Gaussian events. Indeed, the real data stream from the detectors is not expected to be purely 
stationary and Gaussian, unlike what is assumed in most of the GW data-analysis literature thus far. In fact, the data 
from the Caltech 40 meter prototype interferometer have the expected broadband noise spectrum, but superposed 
on this are several other noisy features |L7| , such as long-term sinusoidal disturbances arising from suspensions 
and electric-main harmonics, which have been studied in other works ||l^,|l9|. There are also transients occurring 
every few minutes, typically due to servo-controls instabilities or mechanical relaxation in suspension systems, etc. 
Adaptive methods are being explored to combat high amplitude ring-downs and sinusoids occurring in the data po| , 
by effectively removing them from the data, so that the data are 'cleaned' from these non-Gaussian features. In the 
improved detectors of the future, it is expected that the noise will tend to Gaussianity, and may only be occasionally 
contaminated by non-Gaussian events. It is in this spirit that our above assumption about the noise must be taken. 
The strategy we adopt in dealing with such a detector noise is to assume it to be stationary and Gaussian for the 
purposes of our statistical analyses. Such an assumption is justified in practice provided one vetoes out detections 
due to the occasional non-Gaussian events occurring in the data obtained from the detectors. The vetoing criterion 
we propose is an extension of the x^ criterion used in Ref. p7| ] . This model of the noise is simple enough so that 
analytical methods can be used for the approach we take. 

We also assume that noises to be uncorrelated among different pairs of detectors. When the detectors are widely 
separated around the globe, correlations among the noises of different detectors are expected to be negligible, and our 
assumption remains valid for such a case. For most networks of proposed detectors this is true, unless it consists of 
the two coincident detectors at Hanford.H In that case, a more general analysis is necessary that takes into account 
possible correlations between the noises in different detectors. Such an approach is being pursued by Finn |pi[| . The 
only other assumption we make on the detector noise is that it is additive. 

We use the maximum-likelihood method for optimizing the detection problem. The problem is formulated by 
obtaining a single likelihood-ratio (LR) for the entire network. We define the "network output", x, as a single 
network- vector, the components of which are the individual detector outputs. Similarly, the "network signal" , s, is 
defined to be a single network- vector, the components of which are the individual detector signals. Since we assume 
the noise in different detectors to be independent, the probability density function (PDF) for the noise of the network 
is just a product of the PDFs of noise in the individual detectors. The LR is then a simple expression in terms of the 
norm of s and the inner product of x and s. In this form, the LR is a function of a complete set of eight independent 
parameters that characterize the Newtonian chirp signal of an inspiraling binary. 

For the assumptions made on the detector noises, a network's logarithmic likelihood ratio (LLR) turns out to be 
the sum of the individual detector's LLRs. The form of LLR allows us to deduce the network matched-filter in a 
straightforward way; it turns out to be an M-dimensional vector with components that are just the normalized single 
detector templates of Sathyaprakash and Dhurandhar ||2^ (henceforth referred to as SD). Inferring this network- 
template is the second result of this paper. The problem of detection then reduces to the maximization of LLR over 
the parameters and comparing its maximized value with the pre-determined detection threshold. We argue that this 
step can be implemented in a way similar to the one suggested in SD. 

To obtain the maximum likelihood ratio (MLR), the LR has to be maximized over the eight parameters: the 
distance to the binary system r, the phase of the waveform at the time of coalescence Sc, the polarization-ellipse angle 
-0, the inclination of the binary orbit e, the direction angles {0, 0}, the time of final coalescence at the fiducial detector 
tc, and the chirp time ^. In principle, this can always be done numerically using a grid in the eight dimensional 
parameter space. In practice, such a strategy is not only computationally infeasible but, as we show in this paper, is 
also wasteful. The first important result in this paper is a new representation for the signal, which is expressed here 
in terms of the complex expansion coefhcients of the wave and the detector tensor in a basis of symmetric, trace-free 



^The Hanford site has two detectors with arm- lengths of 4 km and 2 km, respectively. 



(STF) tensors of rank 2. Such a representation of the signal not only allows us to express the LR for the network in 
a very simple and compact form, but also brings out the symmetries in the response functions of the detectors and is 
at the basis of giving a novel geometric interpretation to the detection problem. Maximization of the LR over four of 
the eight parameters can be performed analytically using the symmetries in the responses, which are clearly brought 
out when the responses are expressed in terms of the Gel'fand functions. Further, the Fast Fourier transform (FFT) 
can be used to maximize the LLR over the parameter t^, as in the case of a single detector. The network template is 
constructed by taking into account appropriate time-delays at the individual detector sites. 
The analytic maximization and the FFT have the following advantages: 

1. They allow us to scan continuously the parameter space for the five parameters r, 5c, e, ip and ic- 

2. They save substantially on the computational cost. 

We are then left with the three parameters, namely, i^, 9, and (p. A full sky search over (6', </>) maps to a time- 
delay window, consisting of all possible time-delays, for a given network. The search over the time-delay window 
may be performed by using the samples of the cross correlations between the signal and the detector outputs or by 
constructing a template bank. The latter approach has the advantage of incorporating the desired mismatch related 
to the fractional loss in signal-to-noise ratio. Turning the argument around, the template bank can also be used as 
guideline to re-sample the data at a rate that is consistent with the desired mismatch and then scan over all samples 
in the time-delay windows. We, therefore, opt to construct a template bank in ^, 9, and (j). This is efficiently obtained 
by first computing the metric as given by Owen p3|. We then obtain the volume of the parameter space, given the 
metric, and divide this volume by the volume spanned by a template to obtain the number of templates. From this 
information, we can easily evaluate the computational costs for the search. Secondly, the metric is essentially the 
Fisher information matrix and its inverse yields the covariance matrix from which errors in the parameters can be 
obtained. 

We apply our formalism to analyze several networks of detectors. First, we examine idealized networks, with all 
detectors having the LIGO-I noise. Such an exercise nevertheless yields useful estimates of computational costs while 
at the same time simplifying the calculations and providing invaluable insights. We then consider the LIGO- VIRGO 
network with their respective noise curves. The computations for this case are done numerically. For these networks 
we estimate the computational speed requirements, sensitivities, and the resolution in the direction to the source. 
We find that the computational costs are high even for the two-detector network. The online data processing speeds 
required are in terms of Gflops and for a 3 detector network the online speeds needed escalate to few tens of Tflops. 
The costs would go up further when FN corrections are incorporated into the waveform. For example, for LIGO-I 
noise and allowing a maximum mismatch of 3% between the signal and the template, the number of templates required 
increases by a factor of about 4 or 5 ||24| . Hence, even for a network searching for PN-corrected waveforms, one may 
expect the computational costs to increase by similar factor. Clearly, our results show that use of hierarchical search 
methods are called for. Assuming the individual masses to be greater than O.SM©, and with LIGO-I noises in the 
detectors at Hanford and Louisiana, the online computing speed requirement is 12 Gflops, for a 3% mismatch between 
the signal and the template. The corresponding figure for one of the LIGO detectors and VIRGO is 170 Gfiops. For 
the three detector LIGO- VIRGO network, the cost rises to few tens of Tflops. The sensitivity roughly scales as \/M 
or a little less, where M is the number of detectors. The resolution in direction is about a fraction of a degree for the 
networks where we have assumed LIGO-I noise and a signal-to-noise ratio of 12. 

The paper is organized as follows. We begin by setting up in Sec. ^the basic mathematical framework required for 
our formalism. This includes a discussion of the various relevant coordinate frames and their relationships with one 
other. We also introduce the wave and detector tensors, using which we define the signal at a detector. The signal 



at a detector is then used to define the network signal and infer the network statistic. In Sec. [II, we present the 
Newtonian chirp in its familiar form. This allows us to define the role of each parameter infiuencing the waveform. It 
also introduces important notations that we follow in the rest of the paper. We then derive a less familiar expression 
for the signal induced by a chirp in a detector. This new representation for the chirp-signal simplifies the analysis 
associated with a network-based detection strategy. In Sec. C^, we show how the detection problem can be optimally 
addressed using the maximum-likelihood method. We present a single likelihood ratio for the entire network. It has 
a very simple form owing to our use of the new representation for the signal. The LR is analytically maximized first 



with respect to r and 5c in the well established way |£2 13 1, and then with respect to "0 and e using the symmetry 
properties of the detector responses [H| . We then show how the FFT can be used to efficiently maximize over the time 
of final coalescence or, alternatively, the time of arrival at a fiducial detector. This is followed by a construction of the 
network template and the network correlation-vector. The window consisting of all possible time-delays is discussed. 
In Sec. ^, we construct the template bank on the rest of the parameter space, i.e., on {^, 9, (f)} by extending Owen's 
p3| method and present a way of arriving at the number of templates required. We then give expressions for the 



computational costs and the resolutions achievable in parameter values. Sec. VI is devoted to the discussion of several 



networks including the realistic network of LIGOs and VIRGO. In Sec. VII, we discuss the statistical properties of the 
optimal network statistic. We calculate the false-alarm and the detection probabilities associated with the network 
statistic, and obtain a relation between the network sensitivity and the number of detectors in a network. We also 
discuss the case where the detector noise is contaminated by non-Gaussian noise events and suggest a vetoing criterion 
based on the x^ - test. 

We use the following convention for symbols in this paper, unless otherwise specified. When it is useful to keep track 
of the complex nature of a network-based or individual detector-based variable we denote it by an uppercase Roman 
letter, whereas the lower case letters are reserved for real variables.^] Network-based vectors are displayed in the Sans 
Serif font. The label I in the superscript or subscript of a variable denotes a (real) natural number that associates it 
with a particular detector. It ranges from 1 to M, where M is the total number of detectors in a network. It can be 
considered as a vector index over detectors. We use the index I for several of the network variables. However, certain 
quantities that do not obviously display a vector character, but still pertain to a detector, /, are denoted by enclosing 
the index in parentheses. Einstein summation convention over repeated indices is used for brevity, unless explicitly 
stated. 

II. MATHEMATICAL FRAMEWORK 
A. Reference Frames 

Our first aim is to obtain a quantity that would define the response of an arbitrary network of broadband detectors to 
an incoming gravitational wave. In this quest, it is important to understand how the responses of arbitrarily oriented 
and arbitrarily located individual detectors to such a wave relate to one another. This is aided by introducing the 
three different frames of reference that arise naturally in such a problem, namely, (i) the wave frame, (ii) the network 
frame, and (iii) the frame of a representative detector in the network. We define these reference frames in terms of 
the following right-handed, orthogonal, three-dimensional Cartesian coordinates: 

(j) Wave frame: We associate with this frame the coordinates {X, Y, Z). The gravitational wave, which is assumed to 
be weak and planar, is taken to travel along the positive Z-direction; then X and Y denote the axes of the polarization 
ellipse of the wave. 

(a) Network frame: There is no unique definition of this frame. For Earth-based detectors being discussed here, if 
the network has a large number of detectors (say, M > 3), a convenient choice is a frame attached to the center of 
the Earth. Let the coordinate system that defines this frame be (xe, J/b, ze)- The xe axis lies along the line joining 
Earth's center and the equatorial point that lies on the meridian passing through Greenwich, England. It points 
radially outwards. The ze axis is chosen to lie in the direction of the line passing through the center of Earth and 
the north pole. The yE axis is chosen to form a right-handed coordinate system with the xe and ze axes. 

For a network consisting of M < 3 detectors, certain calculations can be simplified by using the fact that the corner 
stations (or hubs) of all the detectors will lie on a single plane. Specifically, for M = 3 we define the network frame 
such that one of the detectors is at its origin, a second detector is on one of the coordinate axes, say, z, and the third 
lies on one of the coordinate planes containing the z axis, say, the x — z plane. Later in the text when we consider 
various examples of three-detector networks, we choose this as the network frame. 

{iii) Detector frame: Let (x(/), 2/(/), Z(/)) (with / = 1,2,. ..,M) denote the orthogonal coordinate frame attached to the 
detector labeled /. The (x(/),2/(7)) plane contains the detector arms and is assumed to be tangent to the surface of 
the Earth. The a;(/) axis bisects the angle between the detector's arms. The direction of the yi-/) axis is chosen in such 
a way that (a;(7), ?/(/), 2(7)) form a right-handed coordinate system with the Z(7) axis pointing radially out of Earth's 
surface. 

Apart from the above choices for frames, we define a fourth frame, namely, the frame of a "fiducial" detector 
(henceforth referred to as the "fide"). This frame serves as a reference frame with respect to which the locations or 
orientations of each of the detectors in a network shall be specified. Indeed, we will develop our whole formalism for 
a general network using the fide frame as a reference. It is only towards the end, when we consider specific cases of 
networks, shall we identify the fide frame with one of the three frames defined above, depending upon suitability. 



^Note that quantities such as the gravitational constant, G, though written in upper case, are not complex since they do not 
represent any inherent characteristic of the network or an individual detector. On the other hand, we shall not use an uppercase 
letter to denote a complex quantity when its complex nature is apparent from other means, such as by the use of a tilde, e.g., 
in n, which denotes the, in general, complex Fourier transform of the real quantity n. 



Physical quantities in these frames are related by orthogonal transformations that rotate one frame into another. 
These orthogonal transformations are defined in terms of three sets of Euler angles that specify the orientation of one 
frame with respect to another. To understand these relations, let O be an orthogonal transformation matrix. Let 
((/i, 9, ijj) be the Euler angles through which one must rotate the fide frame to the align with the wave frame. Then 

Xwavc = C(0, ^,V')Xfido , (2.1) 

where Xfidc denote the axes of the fide frame and Xwavc, those of the wave frame. The transformation is an orthogonal 
matrix given by pq | 

(cos ■(/; cos (j) — cos 9 sin (j) sin t/j cos ijj sin 4> + cos 6 cos (j) sin ijj sin ^l> sin 9 \ 
— sin -0 cos (/) — cos ^ sin cos ■(/; — sin ?/) sin + cos S cos </> cos ?/) costps\n9 1 . (2.2) 

sin 9 sin — sin 6 cos (p cos 6 / 

Similarly, if («(/), /3(/),7(/)) are the Euler angles that rotate the fide frame to the frame of the /-th detector, then 

Xdctcctor(i) = C'(a(7),/3(/),7(/))Xfide , (2.3) 

where Xdctcctor/D denote the axes of the /-th detector frame. 

One can imagine yet another frame attached to the source whose z axis is along the orbital angular-momentum 
vector of the binary. The angle, e, between this vector and our line of sight to the binary is termed as the inclination 
angle and has the range [0, tt]. The associated x — y plane specifies the plane of the binary. It is then possible to orient 
the X and y axes on this plane in such a way that a rotation of the wave frame through the Euler angles, (0,e, 0), 
aligns it with the source frame .P] 

B. Wave tensor, detector tensor, and beam-pattern functions 

A gravitational wave can be represented by metric tensor fluctuation, /i^^, about the background space-time which 
we take to be fiat. The subscripts ^ and v denote space-time indices. In the transverse trace-free (TT) gauge, the 
non- vanishing components of /i^^ in the wave frame are h^x = ~hyy = ft.+ , hxy = hyx = hx- Here, /i_|_ and h^ are 
the two linear-polarization components of the wave. When a metric fiuctuation specifically represents a gravitational 
wave, its spatial part is identified as twice the wave tensor, Wij, where i and j refer to space indices and take values 
1, 2, and 3 (see Ref. |^). In the TT gauge, the wave tensor is a symmetric trace-free (STF) tensor of rank 2J^ In any 
arbitrary frame, the wave tensor can be expressed in terms of its circular-polarization components as, 

w'^t) = ^[{h+{t)+thx{t))4 + {h+{t)-thx{t))e'i] , (2.4) 

where e^ ^ are the right and left-circular polarization tensors, respectively. The e^ ^ are both second rank STF 
tensors and obey the orthonormality conditions, 

'^L,R^*L,Rij — ^ 1 ^L,R^*R,Lij—^ ; (2-5) 

where a star denotes complex conjugation. The reality of the wave tensor ensures that 

(2.6) 



Using (2^), the wave-tensor expression (2^) simplifies to 



^However, since we will be expressing the gravitational- wave metric fluctuations, hf_i^, in the transverse-traceless gauge (see 
below), in addition to this rotation we must also project /i^^ orthogonal to the direction of the wave in order to obtain 
its components in the wave frame. In that case the polarization-ellipse angle tp can be included as an Euler angle in the 
transformation 0{il),t, 0) of the wave frame to the source frame, instead of including it in the rotation from the fide frame to 
the wave frame, as is traditionally done. This observation will be used in Sec. |V|to obtain a reduced statistic for the detection 
of chirps. 

*See appendix M for properties of such tensors. 



w'^it)^^ {h+{t) + ih^{t))e'^ , (2.7) 



where di[A] denotes the real part of a complex quantity A. 

In an arbitrary reference frame, the polarization tensor can be expressed as 

e£ = m m-' , 

where the m*^ is the A:-th component of a complex null vector m in that reference frame. It is defined as 



(2.8) 



m^ = i=(e^+*e^) , (2.9) 

where ex and ey are unit vectors in the X and Y direction of the wave axes, respectively [^. The above expression 
shows that the e^ ^ are STF tensors of rank 2. Hence, they can be expanded in a basis of STF-2 tensors, 3^2ni (see 
appendix ^ : 

^ and et^^J^T,-yt , (2.10) 

where the expansion coefficients, T±2", with n = 0,±1,±2, are Gel'fand functions ||2^. (For a more elaborate 
discussion on these functions see Refs. IqM.) These functions depend on the Euler angles through which one must 
rotate the reference frame to the wave frame. If the reference frame is chosen to be the fide, then these angles are 
just {(f), 6, tp). While implementing more than a single frame-transformation in relating these two frames, the addition 



theorem (A4) for Gel'fand functions is used to obtain the required wave tensor. In such a case, the wave tensor 
depends on more than one set of Euler angles. 
The /-th detector tensor, d\, , is given by 

dlj = '^(/)li'^(/)lj - "-(/)2j"-(/)2i , (2-11) 

where 'nn)i and n//\2 are the unit vectors along the arms of the /-th interferometer, which is taken to have orthogonal 
arms.|j Like the polarization tensors, even d^ is an STF tensor of rank 2. Hence, in any frame it can be expanded 
in a basis of STF-2 tensors. In such a basis, the components of df can be expressed in terms of Gel'fand functions. 



T±2" (see Eq. (B3)). In the fide frame, these functions depend on the Euler angles {«(/), /9(/),7(/)}, which specify 
the relative orientation of the /-th detector with respect to the fide. 

When detectors are distributed around the globe there are, in general, relative delays in the arrival times of a 
particular phase of a given wave at different locations. Let r(/) (6*, </)) be the relative delay between the arrival times 
at the /-th detector and the fide, where the source direction is given by (6*, cj)). If r\{9, 0) is the unit vector along the 
direction of the wave, i.e., h{0, 4>) ~ Z, then 

M^M):n(^ , (2.12) 

c 

where r//) and rtf\ are the position vectors of the /-th detector and fide, respectively, with respect to any given 
reference frame. Note that T(i){0, (p) can take positive as well as negative values. 
The signal in the /-th detector is the scalar 

/(t) = ^»^(i-r(,))4 , (2.13) 

which, by definition, is invariant under coordinate transformations. Above, w*^ (t) is the wave tensor at the location of 
the fide at time t. It is a function of h-\-{t) and hx{t), which define the amplitudes of the two polarization components 
at time t and at the location of the fide. The above definition shows that the signal depends on the projections of the 
polarization tensors, e'l ^, onto the /-th detector tensor, df-. These projections are 

F^ = e^4, F'*=4di, , (2.14) 



^See appendix M for a more general expression. 



which are the beam-pattern functions for the left- and right-circular polarizations, respectively. They depend on the 
direction of the wave and the orientation of the detector. Owing to any motion of the source with respect to the 
detector this orientation can change with time. Hence, in general, F^ are functions of time. Since we will be concerned 
here with only short-duration signals, we will assume these functions to be independent of time (which is valid to a 
very good approximation) . Using the above definition of the beam-pattern functions and the wave-tensor expression 



( p. 71) in Eq. (|2.13| ), we find the signal to be 

.s'it)^^[{hl{t)+zhi{t))F'*] , (2.15) 

where h^_^_{t) = h+{t — ti^i)) and h\^ {t) = hy,{t — r(/)) are the time-delayed amplitudes of the two polarizations of the 
wave at detector /. 

C. Network signal and network statistic 

The signal from an inspiraling binary will typically not stand above the broadband noise of the interferometric 
detectors; the concept of an absolutely certain detection does not exist in such a case. Only probabilities can be 
assigned to the presence of an expected signal. In the absence of prior probabilities, such a situation demands a 
decision strategy that maximizes the detection probability for a given false alarm probability. This is termed as the 



Neyman-Pearson criterion |27|. Such a criterion implies that the decision must be based on a statistic called the 
likelihood ratio (LR). It is defined as the ratio of the probability that a signal is present in an observation to the 
probability that it is not. This is the criterion we employ in formulating our detection strategy. 

In order to define a strategy to search for signals in a noisy environment, it is important to recognize the char- 
acteristics of the noise. Here, we assume that the noise, n^ (t), in the I-th detector (a) has a zero mean and (b) is 
mostly stationaryM and statistically as well as algebraically independent of the noise in any other detector. These 
requirements are mathematically summarized, respectively, as: 



n^{t) = 0, (2.16a) 

WfWif) = Shii){f)S{f ~ nsj , (2.16b) 

where the over-bar on a symbol denotes the ensemble average of that quantity and the tilde denotes the Fourier 
transform, 

/•OO 

n\f)^ / n\t)e-^"f'dt. (2.17) 



Also, s/j(/) (/) is the one sided power-spectral-density (PSD) of the I-th detector. Note that s/,,(/) (/) is the Fourier 
transform of the auto-covariance of the noise in detector /. We also assume the noise to be additive. This implies 
that when a signal is present in the data, then x^ (t) is given by 

x'{t) ^s'{t)+n'{t) , (2.18) 

otherwise x^{t) = n^{t). 

As we shall see below, an important tool in the theory of detection of known signals in noisy environments is the 
cross correlation between a signal template and a detector's output. In order to define it, consider two real, sufficiently 
smooth, and absolutely integrable functions of time, namely, a{t) and b{t). For the purposes of this paper, we can 
assume that the signal template, s^{t), and the detector outputs, x^ {t), belong to this category of functions to a good 
approximation. A cross correlation can be represented in terms of an inner product, which is defined as 

{a,h)^i^=2^rdf~^^W) , (2.19) 

J a Sh(i){f) 

where a(/) and &(/) are the Fourier transforms of a{t) and 6(i), respectively. In order to obtain the correlation 
between a complex function A(t) = ai{t) + ia2(t) and a real function b{t), we adopt the following convention to define 
the inner product 



^In reality, detector noise contains non-Gaussian and non-stati onary components. To accommodate such features in our 



treatment, we use vetoing techniques, which are discussed in Sec. VII B 



{A,b) = {ai,b)-i{a2,b). (2.20) 



This definition is consistent with the convention of ( 2.19 ) where the complex conjugation is performed on the first 
entry in the inner product. 

For a network of M detectors, the data consist of M data trains, {x^ {t)\I — 1, 2, ..., M and t E [0, T]}. The network 
matched-template can be obtained naturally by the maximum-likelihood method, where the decision whether the 
signal is present or not is made by evaluating the likelihood ratio (LR) for the network |g^ . Under the assumptions 
made on the noise, the network LR, denoted by A, is just a product of the individual detector LRs. In addition, for 
Gaussian noise, the logarithmic likelihood ratio (LLR) for the network is just the sum of the LLRs of the individual 

M 

lnA = ^lnA(/) , (2.21) 
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1=1 



where f|13| 



In A(,) = (/,x^)(,)-i (/,/)(,). (2.22) 



2 
The network LLR takes a compact form in terms of the network inner-product, 

M 



(s,x)ArM/ = ^(/(t),a;^(i))(/) , (2.23) 

7=1 

where 

s{t)^{s\t),s\t),---,s^'{t)) (2.24) 

is the network template-vector, which comprises of individual detector-templates as its components, and 

x(f) = {x^{t),x^{t),---,x^'^{t)) (2.25) 

is the network data-vector. It can be shown by using the Schwarz inequality that the network template, s, defined 
above yields the maximum signal-to-noise (SNR) amongst all linear templates and, hence, is the matched template. 
As shown in Ref. |13|] , in terms of the above definitions, the network LLR takes the following simple form: 

InA = (s,x)jvM' - ^(s, s)jvw , (2.26) 

which is a function of the source parameters that determine s. Given s, different selections of source-parameter 
values and, therefore, different values of s result in varying magnitudes of the LLR. The selection that gives the 
maximum value stands the best chance for beating the pre-set threshold on the LLR. Since scanning the complete 
source-parameter manifold for the maximum of LLR is computationally very expensive, we propose to perform its 
maximization analytically over as many parameters as possible. This requires the knowledge of the analytic dependence 
of the network matched-template on source parameters. This is what we seek below. 

III. THE SIGNAL 

Assume that the binary is at a luminosity distance of r from the Earth[|. Further, let mi and 7TI2 be the masses of the 
individual stars. Then, in the Newtonian approximation the two corresponding GW linear-polarization components 
in the wave frame, at the location of the fide, are 



''Here, r is not to be confused with the magnitude of a detector position- vector, which always carries as an index the label of 
the detector, i.e., (/) or (/). 



2Af _i 



1 + cos e 



COs[x{t]tc,C)+^c 



r z, 

r 



where 



AA; 



2G5/3X5/3(7r/,)2/3 



(3.1a) 
(3.1b) 

(3.2) 



is a constant appearing in the chirp ampHtudc having the dimensions of length. It depends on the binary's 'chirp' 
mass, M. = (r?zim2)^/^/(mi+m2)^/^, and a fiducial chirp frequency, fs- Usually, /s is taken to be the lowest frequency 
in the bandwidth of a detector - the seismic cut-off - hence the reason for the subscript s. This choice of the fiducial 
frequency maximizes the duration of tracking the chirp because the chirp frequency increases monotonically with time. 
Here we set fs — 40 Hz, which is the seismic cut-off for LIGO-I, because every network we consider below has at least 
one detector with LIGO-I noise. Note that a general network might include several detectors with different seismic 
cut-offs, fs(i)- Even in such a case, it is convenient to use the fiducial frequency as a reference. This is apparent in 
Eq. (C5) where the noise moments for different detectors are merely scaled by appropriate powers of (/s(/)//s). 
A quantity closely related to the chirp mass is the so-called chirp time. 



^ = 34.5 



M 

Mq 



-5/3 



fs 



40 Hz 



-8/3 



sec. 



(3.3) 



which equals the time duration for which the chirp exists in a detector's sensitivity window from the time of arrival 
until the time of final coalescence. The time of arrival, ta, is defined as the time when the instantaneous frequency of 
the chirp equals the fiducial frequency, i.e., f{ta) = fs- Formally, the coalescence time, tc, is the time at which the 
chirp frequency diverges (see Eq. (|3.5|)). The corresponding phase of the wave- form at tc is 5c- We define the quantity 



a{t\tc,0 



tc~t 



(3.4) 



and the instantaneous frequency. 



fit;fs,tc,0^fsa-'/'{t;tc,0^fs 



tc-t 



-3/8 



which diverges at final coalescence. The above expression also shows that 



ic l^n 



e. 



Finally, the instantaneous phase of the waveform is x{t) + ^c, where 



xit;fs,tc,0 



f{t'-Js.tc,Odt' = --iTfsia^"{t-tc,S,) ■ 



(3.5) 



(3.6) 



(3.7) 



The t wo GW po larizati on amplitudes at the /-th detector site are obtained by substituting t with (t — t/j\) in Eqs. 
(|j1), dU), (Q, and 1^. 

A chirp signal registers itself in a detector's output only after its instantaneous frequency crosses the seismic cut-off 
of that detector. Thus, a signal arrives in the /-th detector's bandwidth when its instantaneous frequency reaches 

/ = fs{i) and it lasts there for a time duration equaling ^(7) = ^ {fs{i)/fs) - Alternatively put, the chirp waveform 
at the /-th detector starts at t = tc + Vi) — C(i) and ends at t = tc + '''(i)- 

In order to formulate a strategy for detecting a chirp, it helps to isolate the factors in the two polarizations, /i+.x, 
that are time dependent from those that are not. To this end, we define two mutually orthogonal normalized waveforms 



4 and 4/2' with 5^,^/2(0 
signal - as 



So, 7r/2(^ — ■'■(/)) J and their complex combination S^ — Sg 






^■Kji 



the normalized complex 



(3.8) 



Here, 



5(/) 



is a normalization factor such that 
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(5^5^)(/) = l 



(3.9) 



We now obtain an expression for the normalization factor, (?(/)■ In the stationary- phase approximation (SPA), the 
Fourier transform of S^ (t) for positive frequencies is. 



/oo 
-OO 



2 / 2 / / ^ "^/^ 



5(7) V •J/s \/s 



(3.10) 



where 



= *(/;/.,ic, 0-2^/^(7) , 






-5/3' 



4 



(3.11) 



for the Newtonian chirp. Note that ^(/) = ^ for vanishing time-delay (r(/) = 0). Thus, ^ de fines the phase in the 
FT of the normalized complex signal at the fide, in the SPA. The normalization condition (3.9) implies that. 



2 _8 4/3 f°° df 



(3.12) 



where /^(^^ is the seismic cut-off for the /-th detector. 



A. The signal at a detector 

The signal due to a Newtonian chirp at the /-th detector can be expressed in terms of s^ {t). It is obtained by the 
chirp-specific components h^_^_ ^ from (3.1). We now express the GW circular-polarization components, {h^_^_ + ih\^), 
in terms of the normalized complex signal 5*^ and the overall amplitude n. In the special case of the face-on binary 
(i.e., e = 0), the signal at the detector is given by 



s\t)^2K-&[g(i)F^S\t)e" 



(3.13) 



where k = My/^/r. Note that 5c is detector independent and separates out as a phase factor in the expression for the 
complexified s^{t). 



The generalization of ( 3.13 ) for arbitrary value of e is straightforward. In this case, s^ can be expressed as follows: 

s'{t) = 2K^[{E*jS')e'^-] , (3.14) 

where we have defined the extended beam-pattern functions 

^ 1 -I- cos^ e „ 



E' - 3(7) 



'-di{F')+i cos e^{F^) 



(3.15) 



Here, 5R(F^) and 3(F^) are th e rea l and imaginary part s of t he detector beam pattern functions, respectively. In the 
limit e ^ 0, the signal in Eq. ( ^.14| ) reduces to that in ( 3.13 ). In terms of the Gel'fand functions, we have 



E' =g^j^T2P{^,e,0)Dl, p = ±2 



where, for a detector with orthogonal arms (see Eq. (B4)), 

Dl = -iT/(0, 9, 0) (t2% («(,),%), 7(,)) - T-2% («(,),/?(,), ^(,))) , 



(3.16) 



(3.17) 



which obeys D^* = DL„. Thus, E^ depends on the source-direction angles, {6*, 0}, the angles, {e, -0}, as well as on the 
orientation of the /-th detector relative to the fide, given by the Euler angles (a(/), /?(/), 7(/)). Also, E^ depends on 
the signal-normalization factor g(/) , which expresses the sensitivity of the detector to the incoming signal. As we find 
in the next section, the fact that the dependence of E^ on {e,'0} factors out in each summand in (3.16|) will turn out 
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to be a useful property in obtaining the optimal statistic for the detection problem. Thus, the signal at the detector 
depends on a total of eight independent parameters, viz., {r, Sc, tp, e, ^c, C, ^, (t>}- The ranges of the four angles are as 



follows: V' € [ 0,27r 



ee [0,7r] (/)£ [0,27r], and 61 G [0,7r]. 



From Eqs. ( 3.16 ) and ( 3.17 ), it is clear that E^ can be resolved into various factors (using the addition theorem ( A4) 
for Gel'fand functions). One may interpret E^ as the combined amplitude gain of the source and the I-th detector. 
As was shown in Ref. ||l^, up to an r-dependent factor, \E^\'^ can be interpreted as the total power transferred to the 
I-th. detector.^ More appropriately, it is the gain factor associated with the I-th detector. It can be decomposed into 
a sum of the fractions of power transferred from a signal to the detector by each of its two polarizations 



Expression (3.14) shows that the contribution of the extended beam-pattern function, E^ , to the signal, s^, comes 
from its magnitude as well as its phase. In the case of a single detector, these contributions cannot be separated from 
the overall amplitude and the "effective" initial phase of the signal. Thus, one cannot obtain precise information, even 
in the absence of any kind of noise, about the parameters {e, ip, 9, </>}, which affect the signal only through E^ . As a 
result, for data analysis involving a single detector, it is more meaningful to resolve E^ and express the signal in the 
form s{t) = git) cos(x(i) + ^) where q and w are overall amplitude and the effective initial phase of the signal, which 
get contributions from E. This is precisely what was done by SD. While using a network with multiple detectors such 
a degeneracy in parameters can be broken. Indeed, information about source direction, i.e., (0, (/>), can be obtained 
from the time-delays t/]\{9,(J)), by using the triangulation method. More pertinently, even when all the detectors 
are coincident in a network, one can use a set of independently oriented detectors to recover information about the 
different _E^'s and, consequently, about the parameters {e,^p,9,(j)} ||§,|9|]. Hence, in data analysis with a network it is 
crucia l to track the effect of E^ on the signal explicitly. This is where we shall find the form of the signal given in 



(E 



14) useful in rest of the paper. 



B. Network signal normalization 

The total energy in a signal that is accessible to a network is just the network scalar {s,s)nw, and is given by 

(s,s)w = ^(s^(i),/(t))(7) 
1=1 

M 

= Ak^^E*iE^ ^h^. (3.18) 

1=1 

The quantity J2f=iE*iE' = E • E =|| E Ip is the C^ norm of E' in C*^. To understand the significance of ||^||2 
consider a network comprising of detectors with identical noise PSDs and, therefore, identical 5(/)- For simplicity, 
let 5(7) = 1 for all /. Then, for a given set of values for {ip, e, (p, 9}, ||-E|p is a pure number. It defines the factor by 
which the energy accessible to such a network is larger or smaller than the maximum energy that is accessible from 
an identical source, but with ■;/; = = e, to a (favorably oriented) single detector. This maximum energy is 4k^. 
Therefore, b^ represents the total energy in the signal that is accessible to a network. It is just the sum of the signal 
energies accessible to each individual detector in the network. The quantity b can be regarded as the signal strength 
accessible to a network and has the following properties: 

(i) If the detectors have identical noise PSDs and are oriented identically, then we have || E |poc M and, therefore, 
the strength obeys b oc vM. This clearly shows that for a given k, a network of detectors can probe deeper than a 
single detector, by a distance that is -s/M times larger. 

(ii) If the detectors are oriented identically but have different noise PSDs, then the amount of energy accessible to 
each detector is proportional to the optimal SNR of that detector, namely, g(/). The detector having maximum SNR 
will contribute the most in terms the energy accessible to the entire network. 

(Hi) If the detectors have different orientations but identical noise PSDs, then the amount of energy accessible 
to each detector is proportional to the modulus square of the extended antenna-pattern function of the individual 
detector. In Fig. H, for a network of two detectors, b^ is plotted as a function of 9 and cj) (for e = and ip = 0). 

Another important quantity of physical interest is the signal power averaged over all the directions and the orien- 
tations of the source, i.e., 9, (p, e, and f/i. 



'in Ref. |I3|, the symbol VK(/) denotes a quantity analogous to E^ . 
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(b^ 



sin ede / dip 







sin 0d6l / d0 b^ 

"'0 



16^2 y^ 

This power is clearly independent of the orientations of the detectors. A detailed calculation shows that 



'2 \ ^ *^ 

' 1=1 



(3.19) 



(3.20) 



The factors (;(/) are then important in deciding the average signal strength. 

FIG. 1. Plots of the quantity || E |p for a two-detector network as a function of the direction to the source, with e = and 
-0 = 0. In (a), the two detectors are identically oriented with arms lying in the X-Y plane, but have different noise PSDs, say 
(/(I) — 1 and 3(2) — 0.25. Whereas in (b), the two detectors have identical noise PSDs, i.e., (7(i) = (?(2) = 1, but have different 
orientations, say, a(i) — /3(i) — 7(1) = and a(2) — 7(2) = 0, /3(2) — 90°, with respect to a fide frame. 
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The above analysis also suggests the normalization for the network signal. The signal vector with unit norm is 
defined by s = s/b. Its components are 



s^ = 3? [{Qj*S')e''^] 



where 



/ _ 



Q'^ 



E' 



(3.21) 



(3.22) 



Note that the network vector Q = {Q ,Q , ...., Q ) lies in C and has a unit norm, i.e., || Q || ~ 1. 

IV. MAXIMIZING THE LLR 

In the case of a single detector, the LLR is a functional of the data as measured by that detector. For a network of 
M detectors, one needs to compute the statistic in terms of the network data-vector x. When our assumptions about 
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the statistical properties of detector noise are valid, the appropriate network LLR is given by Eq. ( 2.26| ). The optimal 



network statistic is obtained by maximizing this LLR over the eight physical parameters that define the signal. It is 
this maximized LLR that must be compared with a predetermined threshold, corresponding to a given false alarm 
probability. In the following two subsections, we show how such a maximization over four of the parameters can be 
performed analytically. Subsequently, we describe an efhcient way of maximizing over the time of final coalescence 
or, analogously, over the time of arrival (at the fide) of the signal. 

A. Maximizing the LLR over b and Sc 

We begin by analytically maximizing the network LLR with r espec t to two parameters that are simplest to handle, 



namely, r and Sc- Note that the network LLR obtained in Eq. ( 2.26| ) can be expressed as an explicit function of b: 

M 



lnA = bf](s^x^)(,)-ib2. (4.1) 



2 

7=1 

Above, the luminosity distance, r, appears only through b. Maximizing In A with respect to b gives 

M 



h = J2{s',x')(^j) , (4.2) 



1=1 



where a hat denotes the value of a variable at which the LLR is a maximum as a function of that variable, keeping 
all other variables fixed. Here, the value of LLR at b = b is 



l'^^lb = Kp^''^'>«) 



= ^(5R[e-^^(C.Q)])' , (4.3) 

where we have defined, 

C;=c^~^4/2^(5^x^)(,), (4.4) 

with Cq = {s^,x^)/n and c^,2 — ('*W2' ^^)(^)- ^^ ^^ ^ complex quantity that combines the correlations of the two 



quadratures of the normalized template with the data. We proceed further and maximize the LLR in (L3) with 
respect to Sc- This yields. 



<5e = arg(C-Q) , (4.5) 



and the LLR maximized over b and Sc as. 



|2 



liiAb^--|C.Qr. (4.6) 



Now the maximized LLR is a function of six parameters, namely, {e, ip, tc, S., 9, </)}. 

When all the detectors are "closely" located or coincident, it is only the Q^'s that depend on four angles {9, (j), e, tp}; 
the C^ then depend only on {ic,T(/),^}, with all the times of arrival being equal. We refer to this situation as the 
"same-site" approximation. Such a case was dealt with in Ref. iQ. When the detectors are spatially well separated 
or non-coincident, the C^ depend on {9, 0} as well. In such a case, maximization over the remaining parameters is 
not as simple as in the same-site approximation. However, in Ref. [|l4| it was found that analytic maximization over 
two of the angular variables, {ip, e}, is possible even in the case of non-coincident detectors. This useful observation 
allowed further reduction of the LLR to obtain a network statistic. In the following, we briefly mention this analytic 
maximization before discussing how the reduced statistic can be used for searching chirps. 
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B. Further maximization over e and tp 

For spatially separated detectors the cross-correlation, C^ , is strongly dependent on the time of final coalescence, 
tc, and the time delay, Tj-/). Since r(7) depends on the source-direction, (0, </>), so does C^. This prompts us to recast 
the statistic, ([4.6|), in such a way that its dependence on the angles {ip,e} is isolated as sh own b elow . This aids in 
the analytic maximization of InAj^^- over the angles {e,V'}- We note that using Eqs. ( 3.16| ) and ( ^.22 ), the network 
vector Q can be re-expressed as 

Q = ^ [T2-H^, e, 0)D_2 + n^ii^, e, 0)D+2] 

= Q-2D_2 + Q+'D+2 , (4.7) 

where Dp [p = ±2) define two network vectors with the components g(j\D^, respectively. The vectors Dp are their 
normalized counterparts. Since Q has a unit norm, the above expression implies that 

g-' = [^r2-2(^,e,0), Q+^ = \^T^{i,,e,0) , (4.8) 

where || D || ^ || D+2 || = || D_2 ||. 

The following relations hold among Q^^, Q±2, and D±2: 



g+2 = D+2 • Q = 


= Q+' + Q-'D+2-D_2 


g_2 = D_2Q- 


.g+2D_2-D+2 + g-2 



(4.9) 

The pair {D+2, D-2} defines a two-dimensional complex subspace in C^^, on which a metric Gpq can be defined. D+2 
and D_2 depend only on the direction of the source and the orientation of the detectors, that is, oii{9,(j), a/j\ , (3/j\ , jtj\ } 
and not on {e,ip}. The metric components in the D+2 basis are given by, 

1 D+2 ■D-2 

D-2 -0+2 1 



Gp,= U /n ^ ^\' • (4-10) 



This metric can be used to 'raise' and 'lower' indices of vectors lying in this complex subspace, e.g., one has Qp — 
GpqQ"^, where p, q — ±2. We observe that, in general, D+2 are not orthogonal. For a face-on binary (i.e., e = 0) 
revolving anti-clockwise (or clockwise). Dp itself is proportional to the network vector E for p = —2 (or for p — 2). 
Hence, we may call the two-dimensional subspace as the "helicity" plane, Ti. 

The M-dimensional complex correlation vector C, in general, lies outside H. However, Q lies totally in H. Thus, 
the statistic reduces to 

2\nX\^j^^\C-Qf^\Cn-Q\' , (4.11) 

where C-h is the projection of C on H. Maximization of the above statistic over {e, tp} is achieved by aligning Q along 
Ci-i by a proper choice of ip and e. To show that this is always possible, we expand C-^ in the two-dimensional basis 
(D+2, D-2): 

C«-C^2D_2 + Ci'D+2 , (4.12) 

where C^ = GP'(Dq • C). Then the maximization condition is 



)-|-2 ^+2 

' - « (4.13) 



Q-' C^ 



2 



In principle, C can take any value in C^'^ , so the RHS of (4.13) can take any value in the complex plane. The task is 
to prove that the ratio Q~^'^/Q~^ obeys the same property. In other words, we must show that the ratio Q+^/Q"^ 
too can span the entire complex plane. This follows immediately from the observation that 

Q+2 r2-2(^,e,0) /1-coseV ,^.,, ,^^^, 
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For e € [0, n] and ip € [0, 27r], the RHS of (4.14) can indeed attain any value in the Argand plane. One set[| of values 
of f/' and e that maximizes the statistic are 

/■\ fi _ , /\7\' 

(4.15) 



7 arg(C) „ _i 

w = — : — , e = cos 

^ ^ ■ '\c\_ 

where C = C^ /C^-h ■ Thus, the LLR maximized over the four parameters is 

21nA|b,^^,^,, = ||CH||'^A. (4.16) 

Geometrically, we summarize the above maximization over the angles {e, ip} as follows: Choosing a given source 
direction fixes the orientation of the helicity plane in the network space. After making this choice, one projects the 
correlation vector C onto this plane. The vector Q inevitably lies in this plane. Thus, the values of {e, f/;} that 
maximize the statistic are those that align the vector Q along the projected vector C-h. 

It is always possible to choose in Ti, a two-dimensional basis comprising of a pair of orthonormal real vectors. In 
such a basis the components of any vector in Ti. will, in general, be complex. For the sake of concreteness, we define 
one such basis, (v+jV"), in the following way. We split 0+2 hito its real and imaginary parts 

D+2 = di+id2 , (4.17) 

where di and d2 are real vectors. We then define 

V± = (di±d2)/||di±d2|l . (4.18) 

Taking the projection of C on v±, i.e., C^ = v^ • C = Cq + ic^/j, we re-express Eq. ( 4.1(]| ) as 



II Cn\f = \C+f + \C-f = (c+)2 + (c+^^r + {c^f + {c-„f 

= 1? . (4.19) 

It can be verified that the statistic is, therefore, a sum of the squares of four Gaussian random variables with con stant 



variance. With an appropriate choice of normalization, these variances can be made unity. As we show in Sec. VII, 
this simplifies the computation of detection thresholds and probabilities associated with the above statistic. Instead 
of using the squared norm of C-j^, we will find it convenient to use as our statistic L =|| C-h ||, in what follows. We 
note that L then scales linearly with the amplitude of the signal vector, rather than its square. We will call L as our 
network statistic. This statistic was first obtained in Ref. ||l4| . 

C. Maximizing L over the time of arrival 

Given a network data- vector, x(i), which may or may not contain a chirp, it is necessary to first compute the 
correlation vector, C, before one can obtain C-^ and, therefore, the network statistic. In order to do so, we need C^, 



for all /. We compute a C^ (or, rather, C|) by first calculating the Fourier transform of the cross-correlation (4.4) 
for an individual detector by using FFTs. Taking its inverse FFT then gives us the C}{t) at all the time lags, r, in 
a cost effective way. Thus we get, 

C*j{T;t'^,e,9\<j>') = {S'{t;t'^ + T,e),x'{t;t,,0)ii) , (4-20) 

where the primed parameters define the detector template. If the values chosen for {^', 9' , (f>'} match those of a chirp 
in the data, then |C^| is likely to peak when r exactly compensates for the difference (i^ — ic)- Indeed, the correct C 
for a network of coincident detectors is just 

C{r;'d')^{C\T;^')} , (4.21) 

where i9 is the four-dimensional template parameter- vector. Also, / takes values from 1 to M. 



^There exist three other values of tp. 
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Construction of C for a network of non-coincident detectors is somewhat more involved owing to non-vanishing 
time-delays, T(/)(0, (/>), that may arise for a given source direction. Recall that the time of arrival at the /-th detector 
is ta + T^i){0,4>). If the detectors are spread around the globe, the times of arrival at any pair of detectors can at 
most differ by 2R(^/c ^ 40 ms, where i?® is the radius of the Earth. For the two LIGO detectors, the maximum time 
difference is ~ 10 ms; for the network of LIGO- VIRGO, it is ^ 27 ms. We note that T(/)(6', (/>) can take positive as 
well as negative values. Its range depends on the location of the fide. If the fide is chosen at the center of the Earth, 
then |t(/)| < R^/c; but if it is chosen to be one of the detectors on the surface of the Earth, then |t(/)| < 2R^/c. 
This contingency is dealt with by using the appropriate set of T(/)'s in Eq. (|3.8|) to obtain the S^{t; t'^, ^'). With this 



in place, the network correlation-vector is given by the same expression as in Eq. (4.21). 

One can obtain the same value for C by an alternative construction, which may be simpler to implement in practice. 
In this method, one first obtains the S^ by setting the arrival time at every detector to equal that at the fide. This is 



the same as computing the S^{t + T(^jy,t'^, £,'), for all /. (Note from Eq. (3^) that, despite appearances, a knowledge of 



T(/)'s is actually not needed for this computation.) With these templates one constructs the following inner products: 

Cnr;t[,a = {S'{t + T^jy,t'^ + T,e),x'{t;t,,0)(i) , (4-22) 

which are independent of the time delays. Indeed, a choice of the time delays is not made thus far in this alternative 
construction of C. To construct the network correlation-vector from the above quantities, one begins by choosing a 
source direction, {6' , 0'}, for the template. This direction is used to compute a consistent set of time delays, T(^i^{6' , 0'), 
for a given network. The network correlation-vector can then be defined as 

C(r;,9') = {^'(T + r(,)(0',0');^c,n} , (4-23) 

where the appropriately shifted time lags for each value of / compensates for the time-delay at each detector. In 
the rest of this section, we discuss the construction of network templates in greater detail. There, for a network of 
non-coincident detectors, we choose this latter prescription for such a construction. 

Network template construction 

Based on the above discussion, we construct a network template as follows. Given a chirp, consider the detector 
with the least seismic cut-off frequency. Label that detector as / = 1. The other detectors in the network are labeled 



such that /^(i) < fg(^2) < • • ■ fs(M)- Then from Sec. |I| we have ^(i) > ^(2) > ■ ■ ■S,{m)- Now, consider the signal in 
the first detector, I ~ 1. It lasts ^(1) seconds. As in the case of a single-detector "network", an individual detector 
template, which is an array of numbers, is constructed to be much longer than the signal: It comprises of a sub-array 
that stores the signal being searched for, followed by a padding of the requisite number of zeros ^^. In the single- 
detector case, it has been shown that a padding factor of 75%, that is, 75% zeros and 25% signal, is a good choice in 
the sense that it optimizes the computational cost arising from the computation of the FFT's. Accordingly, here too 
we pad the template for the first detector with zeros for a time duration of ^ 3^(i) . 

In the sensitivity window of any of the other detectors, that is, for (/ 7^ 1), the signal effectively lasts for a time- 
duration equal to or shorter than ^riy Nevertheless, the simplest way to construct their templates is to let them 
contain the same signal as in the first detector, and for the same duration, namely, ^(1). Such a construction does 
not restrict the network statistic in any way. Its only pertinent implication is that any part of the signal that has 
an instantaneous frequency below fsin will be ineffective in contributing to the SNR in the /-th detector, which 
conforms with what we expect. In the case of a network with coincident detectors, the indi vidua l detector-templates 



so constructed define the components of the network template-vector. Using it in Eq. (4.21) yields the relevant 
network correlation-vector. 

Obtaining the template-vector of a network with non-coincident detectors is trickier. This is essentially due to the 
possibility of the occurrence in a given detector of negative time-delays. We deal with this possibility by splitting the 
padding into two parts of durations Td and (3^(i) — r^), respectively. Since the maximum magnitude that a time-delay 
can have is less than 50 ms, a choice of t^ = 50 ms satisfies all requirements at negligible cost. This is the value we 
assume for Td in our simulations. Thus, the template of any one of the detectors in such a network is an array of 
numbers that begins with a "pre-padding" (with zeros) of duration Td preceding the signal of interval ^(1), which in 
turn is followed by a "post-padding" of duration (3^(i) — Td). The network template is then the Cartesian product 
of all these individual templates. Note that this can be taken to define the network template for the most general 
network, regardless of whether the detectors in it are coincident or not. For non -coin cident detectors, the relative 



time-delays are accounted for in the construction of the network statistic via Eq. (4.23). 

In order to construct the network correlation-vector, C, one utilizes the above templates as follows. Using these 
individual-detector templates one first obtains the correlations C\T]t'^,^') for all values of r (after setting t'^ = Q or, 
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equivalently, t'^ = £,'j without any loss of generality) by using the FFT algorithm, such as in the single-detector case. 
Note that the range of r, for a data train of length T, is r^ < r < T — ^(i) — r^. Next one selects a source direction, 
(6*', 4>'). One evaluates the time-delays, T(^i){9', (/)'), corresponding to this direction.QJ These time delays are then used 
in Eq. (4.23|) to compute C, which in turn is projected on Ti (defined by the sam e selection of {9' , cj)'}) in order to 



obtain C- h. T he network statistic can be easily recovered from this using Eq. (4.16). 

In Fig. [lO| (a), we show a network template for a network of two detectors. In all the panels, the dots represent the 
padding (with zeros), which is introduced before and after the signal. The two panels in (a) depict the two individual 
detector-templates constituting the network template. These two panels correspond to detectors with different seismic 
cut-offs, viz., fgi^i-f — 33 Hz and /s(2) = 40 Hz, respectively. The padding before (i.e., on the left-hand side of) the 
signal is of a duration t^ = 50 ms. The part of the curve for detector 2 that is shown in dots and dashes is ineffective 
in contributing to the SNR. Panels (b) show the relative positions of the signal in the individual detector-templates 
for which ||C|| has a maximum when the second detector has a relative time-delay of (r(2) — '''(i)) = 20 ms. Here, we 
have included the time delay in the network template. 

Above, the chirp time in the detector with the least fs{i) decided the durations of the padding and the chirp-signal 
in all the individual detector-templates. Indeed, these durations are the same in all of them. It may be possible to 
optimize on computational costs by varying these durations in different templates. However, in this work we do not 
pursue this point any further. 

D. Maximization of L over ^, 6, and <j!>. 

Consider the network correlation-vector, C(t), for a fixed value of ^', but for a range of values for r. As remarked 
before, such a vector is constructed for specific values of r and source direction {9' , cj)') by taking into account the time 
delays, tij\{9' , cj)'), appropriate for the network under consideration. The network statistic for these chosen values 
of {9' , (j}') and r can be obtained by first projecting C on the hclicity plane and then computing the norm of the 
projection. A chirp search over {0',^'} for a given configuration of network leads to a "window" of time delays. In 
any network of non-coincident detectors a window, W ^ is a bounded region in the space of time-delays that arises from 
the restrictions on each of the time delays to lie within certain limits. As we illustrate below, these limits, in turn, 
originate owing to the maximum light-travel time between pairs of detectors. Since the data are sampled discretely, 
a window is a bounded region of a lattice in the space of possible time-delays. Consequently, the number of (lattice) 
points in a window of finite "volume" is also finite. We now discuss the shape and size of a window for networks with 
two, three, four, and more than four detectors. 

• Two-detector network. 

For a network of two detectors, there exists effectively a single time-delay function of significance. It is the 
difference in the times of arrival of the wave at the two detectors. For such a network we choose one of the 
detectors, say, 1 = 1, as the fide. Then rjij =0 and the time delay between the two detectors is reflected solely 
in T(2), which is restricted to lie within the range [— (ii2/c, di2/c], where di2 is the distance between the two 
detectors. Let A be the sampling interval, which is typically 0.5 ms. Then the "width" of the window, expressed 
in terms of the number of time-sampled points, is Ug = 2t(2)/A, where the subscript S stands for sampling 
and fl for the direction in sky, {6, (p). If we denote the LIGO detector at Hanford by H, the LIGO detector at 
Louisiana by L, and the VIRGO detector by V, then for the two-detector networks formed from pairs among 
these, we have n^{LH) - 40, n^{HV) - 108, and n^{LV) - 105. 

• Three- detector network: 

In the case of a network with three detectors, we once again take the first one (i.e., / = 1) to be fiducial. Also, 
we can always imagine all of them to lie on a single plane. In such a case, there arise two nontrivial time-delays, 
namely, T(2) and r(3). The allowed values of these two time-delays are easily shown to be restricted within a 
bounded region of a plane; this region is circumscribed by an ellipse. Any point in this region represents a pair 
of time-delay values, (t(2),T(3)), corresponding to a given pair of values for the source-direction angles, {9' ,(j)'}. 
The equation of this ellipse is given by 0] 

{r(2)f + {T(3)f{p^ + q^) - 2pT(2)r(3) - q\d^2/cf - , (4.24) 



''Alternatively, one may first select a consistent set of T(7)'s and then deduce {9' , 4>'} from them. 
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fig.(a) 




fig.<b) 





FIG. 2. Network template for a two-detector network. We choose the fiducial frequency as fs = 40 Hz. The instant at 
which the signal reaches fs is shown by a vertical line. We take the chirp time corresponding to the fide to be ^ = 0.3 sec. 
The dots represent the padding with zeros, which is introduced before and after the signal. The two panels in (a) depict the 
two individual detector-templates constituting the network template. These two panels correspond to detectors with different 
seismic cut-offs, viz., fs(i) = 33 Hz and fg(2) = 40 Hz, respectively. The respective chirp times are ^(ij = 0.5 sec and ^(2) — 0.3 
sec. The padding before (i.e., on the left-hand side of) the signal is of a duration Td = 50 ms. The part of the curve for detector 
2 that is shown in dots and dashes is ineffective in contributing to the SNR. Panels (b) show the relative positions of the 
signal in the individual detector-templates for which ||C|| has a maximum when the second detector has a relative time-delay 
of (t(2) — ''"(1)) ~ 20 ms. Each of the detectors has its seismic cut-ofi' equal to 40 Hz, which gives ^(i) — ^(2) = C — 0.3 sec. 



19 



where p = (^13/^12) cosa23 and q = (^13/^12) sin 023, with di2 (or ^13) being the distance between the first and 
the second (or third) detec tors. Also, a23 is the angle subtended by the hubs of detectors 2 and 3 at that of the 



first detector. From (4.24), the "area" of the elliptical window, given in terms of the number of time-sampled 



points, is Tig = 27r X A/((?/S?) where A is the area of the triangle formed by the hubs of the three detectors in 
the network. We find that the number of possible time-delays for the three-detector network of LIGOs- VIRGO 
{LHV) is ng - 3 X 10^. 

• Four and more detectors: 
In the case of a three-detector network, the two time-delays produce two circles on the celestial sphere that 
intersect at two points, which give the possible directions to the source. This two-fold degeneracy is broken 
when we introduce a fourth detector lying outside the plane of the three detectors. In such a case, the possible 
time-delays can be represented in a three-dimensional space of the three time-delays, which now lie on the surface 
of an ellipsoid. The number of possible time-delays is exactly doubled compared to that of the three-detector 
network. Thus, its maximum possible value is «§ = 47r^/(c^A^), where A is the area of the smallest of all 
possible triangles formed subsets of three of the four detectors. When there are more than 4 detectors there 
is redundant information on the direction to the source, but the n§ is the same as for four detectors. In the 
presence of noise this redundant information may be used to reduce the errors in the direction to the source. 
Here, we do not pursue this point any further. 

The sampling interval naturally provides the most simplistic discretization in carrying out the search in time-delays. 
In searching over 6 and (j) one does not need to compute additional Fourier transforms; rather, one needs to combine 
the individual detector-correlations with the correct time delays to construct the optimal statistic, L. This gives rise 
to two components to the computational cost: the cost involved in computing Fourier transforms and the cost arising 
due to the arithmetic operations involved in computing L over all possible time-delays. As shown later, the latter 
cost can be considerable and may dominate over the cost in computing Fourier transforms while searching over ^. 

It is important to note that sampling can introduce an arbitrary mismatch between the actual source direction, 
{0, (j)), and the direction in the template, {6' , (/>'). The mismatch, /x, is the fractional loss in SNR when the signal 
and the template parameter differ slightly. In agreement with most investigations carried out so far, we decide to 
tolerate a mismatch to a maximum of 3%. The sampling can lead to a mismatch either greater or smaller than /i. 
If the sampling gives a mismatch less than the desired one, then this simplistic procedure of scanning W leads to 
unnecessary extra computational costs. On the other hand, if the mismatch is more, then one is likely to miss out 
more events than desired. The question whether the sampling is adequate one way or the other can be resolved by 
constructing a template-bank for the desired mismatch /i. In the next section, we proceed to construct a bank of 
templates in the parameter ^, 9, and 4>. The template bank, in general, will produce time-delays that do not fall 
exactly at the sampled values of the correlation vector. However, we can easily interpolate to obtain the intermediate 
values by applying Shannon's theorem p9[ , which essentially states that a band-limited function can be constructed 
in the time domain from its discretely sampled values at the Nyquist rate. Or in other words, the template bank 
provides the rate at which the output can be re-sampled so as to obtain the desired maximum mismatch. 

V. TEMPLATE BANK IN ^, 6, AND cj> 

Recall that the LLR is a function of eight parameters, namely {r,6c,e,'iJj,tc,S,,d, ((>} for the Newtonian chirp. As 
mentioned earlier, we adopt the maximum-likelihood method for the detection problem. It implies that the LLR 
must be maximized over all the parameters to obtain the MLR. We have shown that the maximization of LLR can 
be carried out analytically with respect to the four of the eight parameters, {r, (5c, e, V'}- Also, we can deal with the 
time of final coalescence at the fide tc efficiently by using the FFT algorithm. Therefore, we now need to formulate 
a strategy to search over the rest of the 3-D parameter space formed by £,, 9, and (j>. Ideally, one should scan the 
whole range of the 3-D parameter space over the physically allowed parameter values. This, however, is impractical 
due to computational limitations. Therefore, a prerequisite for such a maximization is an estimate for the magnitude 
of grid discretization. The grid spacing in the parameter space depends upon the fractional loss in the SNR that one 
is prepared to tolerate. 

To estimate the number of templates in the 3-D parameter space, we take the differential-geometric approach |3Q] 
and use Owen's method of introducing a metric on that space |23] and extend his formula for the one detector case to 
that of the network. Also the inverse of the metric is just the covariance matrix scaled by the square of the SNR. Thus 
the metric also provides information on the errors in estimating the parameters. In this geometric method, the signal 
vector is characterized by P -I- 1 parameters, i?", where a — 0,1, ...,P. The signal vector lies in a (P -f l)-dimcnsional 
manifold denoted by V. We define the metric on V by g^p, which is related to the fractional loss in SNR, denoted by 

20 



/i, when there is a mismatch between the signal and the template parameters. Since here we consider the Newtonian 
chirp as our signal, we have P = 3, i9'^ = tc, and ^* = {^,9,(f)}. As L can be maximized over tc numerically via the 
FFT, we only need to lay the templates in the rest of the P-dimensional parameter space, comprising of {^, 6, (/>}. In 
other words, we need to compute the metric 7,., in the P-dimensional subspace. It is determined by projecting the 
metric gap onto the subspace orthogonal to tc- We then obtain 

Iv = 9tj ■ (5-1) 

ffoo 

The number of templates is obtained as follows. We compute the proper volume of the parameter space with the 
metric jij and multiply the volume by the number density of the templates. Fixing the value of /i determines the 
grid spacing of the network templates in the parameter space. The number density, pp{fi), which is the number of 
templates per unit proper volume, is given by 




(5.2) 

It is defined to be uniform over the whole parameter space and, therefore, its use is applicable as long as the curvature 
of the astrophysically interesting region of the manifold described by 7ij is sufficiently small, and the effects arising 
from the boundary of the region are negligible. 
The total volume of the parameter space is 



V = / ^det II jij II d^^ . (5.3) 

Thus, the total number of templates required is , nj = V x pp{fJ.) ■ In general, the total number of templates depends 
on the source parameters {e, ip}. In order to scan the parameter space (for a given mismatch fi) for each pair of values 
{e, ijj}, we must maximize the volume V over {e, ip}. This is tantamount to choosing the finest bank of templates. For 



simple cases, this is straightforward and has been implemented in some examples in Sec. VI. In general, however, 
such a maximization is non-trivial to perform. 

A. The network metric 

We apply the method described above to obtain the metric in the four-dimensional parameter space {tc,^,0,i 



When the parameters of the network template and that of signal mismatch, the network statistic given by (4.19) 



drops below the maximum value. The metric gap defined on this four-dimensional space is related to the amount of 



drop in the statistic, L, and is obtained by expanding the statistic about the maximum. Using Eq. (4.19) the squared 
statistic can be rewritten as, 

L2 = |v+' . C|2 _,_ |v-' . C|2 

= p'/C'C*j. (5.4) 

The quantity p'j"' is a projection tensor given by, 

Pj' ^v V J +v Vj , (5.5) 

which projects a vector in C^ on the helicity plane spanned by Q'^ . It obeys the identities 

P/Vj^=p;^, p'/p'/ = 2 , (5.6) 

which are consistent with its being a projection tensor on a two-dimensional plane. The primed coordinates refer to 
the template. 

Let -d" and ■(?'" = '&"' + Az?" be the parameters corresponding to the signal and the network template, respectively. 
For computing the metric one takes normalized templates for the signal as well as the template, so that the maximum 



value of L is unity when the parameters of the signal and template match. In the absence of noise, Eq. (4.4) yields 



C; = (5'^/)(,)C.e'^=Ql(5'^5^)(,) , (5.7) 



21 



where, the S'^ denotes the template. The above expression is exact within the SPA. So the statistic can be written 
as, 



where 



i^=py'Q'Q*,Qii)ij) , 



dM(j)(/)exp(i$(j)(/;z?^Ai9^)) 



(5.8) 



(5.9) 



is the product of the individual ambiguity functions of the I-th and J-th detectors. It is a measure of how distin- 
guishable the two wave-forms, i.e., the signal and the template, are. Here, 



A/)(/) 



1 



/ 



-7/3 



(5.10) 



which satisfies the normalization condition J, •A.{i){f)df = 1. Note that in the limit of A9" -^ the projection 

tensor of the filter is same as that of the projection tensor of the signal, i.e. p'j'^ — > pi'^ . In this limit, the projection 
tensor of the signal obeys the relation pi^ Q^Q*j = 1 and noting that S'^'s are normalized to unity, we can see from 



Eq. (1^ that L^ ^ 1 as desired. 

The correlation phase, $(/)(/; i?^, Ai9^), is given by (see Eq. (3.11)): 



$(,)(/; I?'', A^^) = *(,)(/;/„i„C)-*(/)(/;/„i^,^') 



(5.11) 



The correlation phase includes the contribution from the differential time-delay between the signal and the template. 
Instead of using (0, cj)) to specify the direction to the source we use the components ni and 71-3 of the unit vector 
n = {ill, n2, n^) to do so. The time delays in units of f~^ are, 



/sT(/)(n3,7ii) = r(/)ini +r(/)2(l~n3 ~ niy/"^ + r^^if^ns 



(5.12) 



where r^/j is the position vector of the /-th detector's hub and is henceforth measured in units of the "fiducial 
wavelength". As = c/fs- Since we choose to measure the time delays with respect to the fide, we must have r(^) = 0. 
Thus, we may write the correlation phase as 



$(,)(/;^?^ At?'') = 2^^(,),(/; ^'^)A^" , 

where -d" = {/sic, fs^, "-s, ni} is a quartet of dimcnsionless parameters and 

-5/3 



(5.13) 



'^(7)a(/;^") 



n 3// 



r{i)3-r(i)2- 



ns 



l-n?-n?)'/' 



Is 



r(i)i - r(/)2 



ni 



il-nl-nlf' 



Is 



(5.14) 



To obtain ga/j we Taylor expand the network statistic about the peak at Ai?^ = to obtain 



^('''^'')"^ + KmSa^) 



M°'M'^ 



A1?=0 



where i9 is the four-dimensional signal parameter-vector. Note that the first-order term in the Taylor expansion gives 
vanishing contribution. This is because L has a maximum there at Ai? = 0, 






, 



(5.15) 



Al9=0 



for any a. Then, the metric gafii'^, At?) is defined as 
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(5.16) 

Al9=0 



The above differentiations can be performed. But first we study the effect of a mismatch of signal and template 
parameters on the network statistic. 

For a perfect match between the signal and template parameters, the correlation vector C ex Q, lies in the sig- 
nal helicity plane Ti.{9,(f>). When mismatched, however, each component of C gets multiplied by the weight factor 
(S"^, 5'^)('7), i.e., C^ = Q^{S'^, S^)*^jy Since (S"^, S^)*^j-^ depends on the noise PSD of the detector and the time delay, 
which are different for each /, the components of the correlation vector get scaled differently for each I which makes 
the vector C move out of TC. Owing to this mismatch C may lie outside TL{9,(I)) as well as TL{9' ,(j)'). However, the 
maximization over e and '0 requires projecting C onto the template helicity plane T-l{6\(f)') in order to obtain the 
network statistic, || Cn \\. Thus, the value of the computed || C-h \\ can decrease due to two effects: 

(a) reduction in the norm of C, 

(b) C moving out of the signal helicity plane. 
We assume that the orientation of the helicity plane changes slowly as compared to the effect of the time delays. 



This means that we treat p'j'^ as effectively constants in (5.8), and equal to the corresponding tensor for the source 
parameters, namely, pi'^ . The validity of this assumption, to a good approximation, is supported by the extensive 
numerical computations that we have performed for the networks and parameters that we have considered. Since we 
consider the mismatch to be quite small (3%), the templates are closely spaced in the direction angles and hence the 
approximation is valid to about few parts in 10'^ or even better. Thus, from Eqs. (5.8) and ( 5.16| ) we obtain the 
metric to be 

9M^)-\pi-'m'Q*j]9ii)(J)c.p , (5.17) 

where we used the fact that both pj and g{i)[j)ap are symmetric under the interchange of / and J. Also, g[i)(j)ap = 
— (9^0(/)(j)/9At9"9Az9'^) |ai?^=o. The reality of gap is now manifest. Owing to the linearity of $ in A^", the metric 
gafi depends only on its first derivatives. Therefore, it can be easily shown that 

5(/)(J)a/3 = ($a$;3)(/) + ($a$/3)(J) " (*a) (7) ($/3) (J) ~ ($^) (/) ($a> (J) , (5.18) 

where the suffix a denotes the derivative with respect to Az9" . The angular bracket denotes the average over a given 
frequency range. For the frequency range of [/s, oo], the average value of the function X(/)(/) is denoted as. 



(X)(,)= / ^(,)(/)X(,)(/)d/ , (5.19) 

where within the angular brackets we have dropped the subscript on X simply because the same subscript appears 
outside those brackets. In other words, we have reduced redundancy by introducing the notation: {X)ij\ = {Xij\)ij\ 



We observe that (5.18) is a generalization of Owen's formula in Ref. |23|, wherein the metric for the single-detector 
case was derived. It is not difficult to understand the origin of the different factors in the expression for g^p- This 
metric gets contribution from every pair of detectors in a network, including the diagonal terms (i.e., terms with 
/ = J), through the "coupling" metric 5(/)(j)a/3. The magnitude of each of these contributions is determined by their 
respective coupling strengths in the form of coefficients, p/"'5ft[(Q^Qj)], which depend on the four angles {e, "0, 6*, 0}. 
This is because these coefficients essentially arise from the extended beam-pattern functions of the detectors, which, 
apart from depending on the signal amplitude through e, determine how sensitive a given detector is to a source 
direction and i/'. 

The above expressions allow one to calculate the parameter space metric for any Earth-based network. However, 
since the metric is non-flat (as opposed to a flat metric for a single-detector "network"), the template spacings Ai? 
will depend on the location, ■j9, of the template. The general expressions for the moment functionals are: 

(*a)(/) = 27r(/?(/)5aj(/)(7-3TOa) , (5.20) 

where (p(^j)sa = V(i)aif{i)sl ^") and ttIq is the power of / on which <y5(/)Q depends and j(^i){q) is q-th noise moment of 
the noise-curve corresponding to the I-th detector and is deflned in appendix O. Similarly, 

($a$/3)(/) = 'i'^^(P(I)sa'P{I}sl3J{I)i''' - 3(^0 + TTlp)) . (5.21) 

In terms of the above expressions, the coupling metric is 
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9(I)[J)al3 = 47r^{ [P(I)saV{I)sl3J(I)(J " 3(r7la + nifj)) 

- ¥'(/)sQV(J)s/3J(/)('^~3rna)j(,7)(7-3TO^)] + [/^ J] } . (5.22) 

Another form of the parameter-space metric that proves useful in later computations is obtained by taking pro- 
jections on the vectors Dp {p = ±2), which were first used in Eq. (^). Such projections allow us to isolate the 
dependence of the metric on the parameters {e,'0}- This simplification arises because Q lies in the plane of 0+2 and 
D_2- To see this, let us define the following four quantities, 



-;PijD±2D±29(i){j)afi ■ (5.23) 



ff±2,±2a/3 ^ 

It follows from aforementioned properties that 

9-2,2a/3 ~ g2,-2afS, 5-2,-2q/3 — (ff2,2a/3)* , (5-24) 

and that g-2,2af3 is real. The metric is then given by 

111-112 l+COS^e l-COS^e 4^ Zr nrN 

II E Ir <?a/3 = ^ .9-2,2a/3 + ^ ^[e ^'"^ 92,20. p] ■ (5.25) 



We make use of this form in computing the cost for the LIGOs- VIRGO network. 

Before we proceed to the discussion of various cases of networks, we mention a scaling property of the phase <I>. If 
$ is scaled by a constant a, i.e., if $ = a$, then the components of the metric get scaled by a? , viz., 7^^ = a?'ya0- 
Assuming the dimension of the search parameter-space to be P, one finds that det || ^ap \\— a^'^ det || '^ap \\- Thus, 
the proper volume scales as V = a^V. 

We now obtain the errors in determining the direction to the source. The errors are obtained via the covariance 
matrix. Note that 7 is the metric on the three-dimensional parameter subspace with {^, 9, (/)} as the coordinates. From 



a statistical point of view, 7^^ is the Fisher information matrix for a signal of unit amplitude |27|. The covariance 
matrix for a signal of unit amplitude is just 7*^, the inverse of ^ij. The covariance matrix for a signal of arbitrary 
amplitude is obtained from 7*-' by dividing it by the square of the signal amplitude. The diagonal elements of the 
covariance matrix are the variances in the errors of the estimated parameters. The errors in the estimates of the 
parameters are given by the corresponding standard deviations. 



T(,)^^ , (5.26) 



where b is the amplitude of the signal. In the next section, we list crji) in Table II] for various networks 



B. Computational costs 

We now estimate the computational cost involved in searching over the parameters 9, (j) and ^. We consider data 
trains of duration T seconds sampled at the rate A^^. The number of sampled points in each of the M data trains 
is denoted by A^ = T/A. As remarked before, the computational cost involved in obtaining the statistic has two 
important contributions: 

• The cost involved in computing Fourier transforms, denoted by C^^. 

• The cost in computing the optimal statistic while searching over the time delays or, analogously, over the 
source-direction angles, {0, (j)}. We denote this cost by C^, where Q, = {9, </>). 

Also, we consider two different ways of searching over the time delays: 

1. Scanning all the time-delays in a window, W. 

2. Using selected values of time-delays from the template bank. 

For the quantities relevant to (1), we attach a subscript S (for sampling) and to (2) a subscript B (for bank-of- 
templates). Let the number of templates required in searching over ^ in these two methods be denoted by rig and 
rig, respectively. Then n^ is obtained by placing the templates in the P-dimensional space, while rig is obtained by 
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placing the templates just in the ^ coordinate. Thus, n| is the proper length associated with the parameter range of 
the ^-coordinate, multiplied by pi(/i) = l/{2,Jp). This yields, 

n% = ^\/7n(Cmax - Cmin) • (5-27) 




For detectors with identical noise curves, rig is just the number of templates required for searching over ^ in the 
one-detector case for a given coalescence phase, 6c- The quantity rig for a detector network, with all detectors having 
identical noise, is just VP times the number of templates in ^ for a single detector. The factor of \/P comes from 
the fact that the template must reach out to signals that mismatch with templates in all the parameters. For a 
two-detector network, since the bank is in ^ and 6, we have P = 2. For networks with three or more detectors, the 
search is over three parameters and, therefore, P — 3. 

Let us assume that the FT of the templates are stored in memory and the FT of the data has been taken. Then 
the computing cost in FT for the two cases is [ p8[ , 

C^^B = G4,BMNlog^N , (5.28) 

where we have included a factor of 2 for the two sets of templates corresponding to Sc = 0, 7r/2. We assume that 
the vectors \/±{0,(j)) are stored in the memory for each pair of (0, </>) in the window/template bank, typically few 
thousand for the LHV network. The number of real floating point operations ("fl-pt ops" for short) for constructing 
the statistic A is 8M -I- 3 (which essentially behaves as 8M when M is large) for each point in the sky, {9, (f)). Hence 
the costs in two cases are, 

C§ ~ 16MiVngn| , 

eg ~ SMNntot = l6MNn%n% , (5.29) 

where ritot = 2n§rijj. This equation defines ri§. The total costs are given by, 

CsM = 2n|^A/iV(8ng B -I- 3 loga N) . (5.30) 



In (5. 30]), we have ignored the overhead costs. Also, in the template bank case, we have ignored the cost involved in 



computing C at non-sampled values, which can be obtained from Shannon's theorem. The length of the data that 
is effectively processed is equal to the length of the zero padding. This is because if the time-of-arrival of the signal 
exceeds the padding duration, the longest chirp can extend out of the data train. Thus we need to consider times of 
arrival only up to the padding length. The next data train must begin at this instant leading to an overlap between 
successive data trains. To process the data online requires that the processing rate be at least equal to the rate of 
data acquisition. Since the length of the processed data is just the length of the padding with zeros in the template, 
the online computing speed ^5 and Sb are given by dividing the computing costs by the padding duration, 3^i. In 
the next section, we obtain the template bank and the computational costs for various networks. 

VI. EXAMPLES OF DETECTOR NETWORKS: RESULTS 

Before considering the case of the actual network of laser-interferometric detectors being built around the globe, 
we shall first consider some idealized cases that are simple to analyze. Such an exercise is meant to provide us with 
some useful estimates on the number of templates required, computational costs, etc. We begin by verifying that in 
the case of a single detector the formalism developed in the previous sections yields the results expected from earlier 
studies [ P2|j2^ . Then we apply our formalism to cases of networks with two and three identical detectors, respectively, 
with identical noise PSDs. Assuming a common noise simplifies computation of the metric on the signal parameter 
space. Purely for the purposes of obtaining estimates we choose the noise PSDs in these cases to be that of LIGO-I, 
with fs = 40 Hz. Here, cases of non-coincident as well as arbitrarily oriented detectors are also studied. Subsequently, 
we generalize these analyses to obtain estimates for realistic cases of networks comprising of the LIGO and VIRGO 
detectors: These include two-detector networks, which pair up the two LIGOs or VIRGO with one of the LIGOs, and 



the three-detector network that includes VIRGO and both the LIGOs. In Tables ^ and |I^ we summarize our results, 
which include the computational speed requirements, network sensitivities, and source-direction resolutions for all of 
these networks. 
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A. The one-detector 'network' 

It is instructive to start with the one detector case since it lays the foundation for the M-detector case, the analysis 
of which would be the final goal. More pertinently, the number of templates in ^ for the single-detector case is required 
for computing nj in the case of networks with more than one detector. We first verify that our solution gives the 
expected estimates for M = 1. For a single detector, we have i?° = fgtc and d^ = fs^- The network statistic is 

L-|QQi| = |Ci| 

^(/)expH<i>(/;^?^A^?^))d/ , (6.1) 

fs 



where the phase $ can be derived from Eqs. (5.11) and (3.11) by setting I — 1 and the time-delay term to zero in 
those equations, respectively. 



For the above statistic, the exact metric gap can be obtained from Eq. (5.16). Using the scaling <!> — 27r$, the 
scaled metric is 

5c./3 = ^[(l'a$/3)-(*a)('l/3)] , (6.2) 

where the moment functionals can be expressed in terms of the noise moments listed in Appendix |y. The metric then 
reduces to 



where ki, fc2, and ks are certain useful combinations of noise moments and are defined in Eqs. (p4|) (here we have 
dropped the detector index / from those combinations for obvious reasons). Projecting orthogonal to i?", we find 

The parameter-space in this case is just one-dimensional. Its volume is the proper length 

= V2Tr fsk{^„iax - fmm) ■ (6.5) 

Since the number density of templates here is pi(/i) = 1/(2^//!), the number of templates (including 2 sets of templates 
for searching over Sc) is just 

ntot = \ -T^fski^max - £.mm) ■ (6.6) 

V M 

For LIGO-I noise we have k = 0.062, and the corresponding ntot is given in Table || for the parameter ranges listed 
there. 

B. Two-detector networks 

(a) Two non-coincident, identical detectors witti identical noise PSDs and identical orientations 

We consider a network of two identical detectors with identical noise PSDs. We make the following choice of 
coordinates. We choose one of the two detectors to be the fide. The z axis of the fide is chosen along the line joining 
the two detectors. Then the second detector is taken to be located at (0, 0, Z2), with an orientation identical to that of 
the fide, i.e., a(2) = /3(2) = 7(2) = 0. Owing to the same orientations, the beam-pattern functions of the two detectors 
are identical. If the detectors were located at the same place, then the resulting network would have mimicked a single 
detector, but with a higher sensitivity. Here, however, we consider spatially separated detectors, where the relative 
time delay, T(2), provides partial information about the source-direction, namely, 9. The time delay in units of f~^ is 
given by 
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fsT(2) == Z2(z- n) , (6.7) 

where we measure Z2 in units of the fiducial wavelength, As (which is ~ 7500 km for fg = 40 Hz). Thus, the network 
statistic in this case is 

L=|C-Q'| (6.8) 

= ±\C:+C;\. (6.9) 

Note that Q'^ = Q'^ and \Q'^\ = \Q'^\ = l/\/2- This means that we have no information about e and ip. For a 
two-detector network, any given value of the time delay corresponds to more than one source directions, all of which 
lie on the surface of a cone whose axis coincides with the line joining the two detectors. Only when the source lies on 
the line passing through the two detectors is the time delay single- valued, and is of maximum magnitude for a given 
pair of detectors (note that we have allowed T(7) to be negative as well). The value of the time delay r(2) determines 
the opening angle of the cone. Thus, the azimuthal direction angle of the wave remains undetermined in the case 
of two detectors. Only 9 can be estimated from the time delay that appears in the phase difference of the detector 
responses. 



As in case (a), the exact metric gaf3 for the statistic ( |6.8[ ), can be obtained directly from Eq. (5.16). The corre- 
sponding scaled metric is 



~9<^P^jY.l[^^'^^f^hi)-i^c.)ii){^p)ij)] , (6.10) 



4^-^ 2 
I, J 



where the $(/) are defined in Eq. (5.11). The metric on the three-dimensional parameter space {fstcT fs^ii^s} can 



now be given in terms of the noise moments as 

2 / fci fcs Z2fcl/2 \ 

gai3 = :^ { ks fc2 22/03/2 , (6.11) 

^\z2ki/2 Z2k3/2 zi{ki+j{l))/4 J 

where j(l) is a noise moment defined in Eq. (|C^).p^ After maximization gap over the time of coalescence, we deduce 
the metric 7^ to be 

^ I f {k^k2~k|)/k^ \ 

Here, a vanishing 712 implies that there is no covariance between ^ and 9. This is however not true when the noise 
curves are assumed to be different for the two detectors. This is so in case (c), which we discuss below. 

The volume on the 2-D parameter space {fs^, n^} is obtained by integrating -y/det || 7 || over n^ and ^. The proper 
volume of interest is 

V^i27:)^V^27:^f,k'z2{Ua.-Crmn) , (6.13) 



where k' = -\/j(l)(^i^2 — ^D/^i- For the two-detector network, the number density of templates is P2{l^) = 1/(2(U). 
Therefore, the number of templates is 

2tt^ 

ntot^ fsk' Z2{£.„iax ~ S.rmn) ■ (6-14) 

For LIGO-I noise, k' = 0.288. The value of ritot for this case is given in Table ||. 



^'^We have dropped the detector index / from the noise moment combinations fci, fc2, and fcs (see Appendix O since here the 
noise moments are identical for the two detectors. 
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(b) Two non-coincident, identical detectors witti identical noise PSDs, but with different orientations 

We make the choice of coordinates identical to that in case (a) above. Since the two detectors have different 
orientations, the beam-pattern functions for the two detectors differ, i.e., Q^ ^ Q^ . This has the imphcation that more 
information about the signal parameters, namely, e and '0, can be obtained. Since here we have only two dimensions 
on the network space to contend with, the network correlation- vector, C, always lies in 7i and no projection is required. 
Therefore, the problem of maximization of the LLR over the angles {e, t/"} reduces to aligning Q' along C. Thus, the 
network statistic simplifies to 

L=|| C|l= (|Ci|2 + |C2|2)'/'. (6.15) 



The metric gap on the parameter space with coordinates {fstc, fsS., n^} is obtained from Eq. ( 5.16 ) to be exactly 



~9o.p = ^J2\Q'\'[(^-^p)w-i^c,)ii){^p)ii)] , (6.16) 



I 



where the $(/) are the same as in case (a) above. Setting \Q \ = rj and, therefore, \Q \ ~ 1 — rj, the metric in terms 
of rj and the noise moments is 



1 



fei fcs 2:2(1 -77)fc 



ffa/3 = :r ^3 fca 22(1 -17)^3 . (6.17) 
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Z2{1 ~ ri)ki Z2{l-rj)k3 Z2(l - 'n)k 



The associated jij is then given by 



~ _ 1 f {kik2-kl)/k, ^ , . 



We note that the -^/det || 7ij || depends on rj, which is a function of source-direction {6, (/>) as well as the angles 
{e, V'}- Thus, for every single source-direction we have a two-parameter family of metrics 7^ (dependent on e and ip). 
Clearly, the template spacings in this case will vary with their locations. For the purpose of obtaining estimates, we 
perform a simplification by opting to choose a bank of templates that is the finest over these two parameters. To do 
so, we first maximize det || 7ij || over e and ip and then compute the volume. The parameters {e, V'} appear in the 
determinant only through the factor y^ri{l — rj). The value of 77 for which the determinant is maximized is 7/ = 1/2. 
We prove in Appendix O that one can always find a physically allowed pair of {e, ip} such that the value rj — 1/2 is 
attainable for any given pair of {0, 0} and for any orientations of the detectors. We use this value of 77 to compute 
the parameter-space volume. 

The proper volume after multiplying by the appropriate scaling factor is 

V = 2^^^k"MUax-U^n)z2■ (6.19) 



where k" = \/kik2 — fc|, which is equal to ~ 0.13 for LIGO-I noise. As in case (a), the number of templates is arrived 
at by multiplying the proper volume by the number density of templates. 

(c) General case of two detectors 

Here, we typically consider the case of a network comprising of the VIRGO detector and one of the LIGO detectors, 
say, the one at Louisiana, for concreteness. (The results do not differ much if we replace in our calculations the 
numbers corresponding to the detector at Louisiana with those describing the 4 km long Hanford detector.) We 



assume the noise curves of the respective detectors to be those given in Table IV |3l|] . Here, we have a case in which 
the seismic cut-offs are different. Labeling the VIRGO detector as 1 and LIGO as 2, we have /^(i) = 16 Hz and 
fs{2) — 40 Hz. The important implication of this is that the signal in the VIRGO detector will last longer by a 
factor (/s(i)//s(2))~^^^ ~ 11.5. Thus, till the chirp reaches the frequency of 40 Hz, essentially it is only one detector, 
namely, VIRGO that contributes to the SNR. The longest chirp (for mi = ?7i2 « O.5M0) in the detector output lasts 
for about 1588 sec. Accordingly, we choose the data segments to be of duration 5000 sec. each, and assuming the 
same sampling rate of 2 kHz for both detectors, we have N ^ 10 . We now compute the metric and the number of 
templates for this network. 
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The expression for the scaled metric in this case is the same as the one in Eq. ( ^.16 ) . Let Z2 denote the distance 



between the detectors as in case (a) and let jQ^p = 77 as in (b). Also, we take the fiducial frequency to be /s = fs{2), 
without any loss of generality. Then the metric is given by 

fcl(2) +'7(£'^fcl(l) - fcl{2)) ^3(2) +77(£l"^''^fc3(l) - ^3(2)) {'i- ~ V)z2ki(2) 

9af3 = 7; \ k3(2)+ viQ^'^^^ h(l)-h(2)) k2(2)+V{Q'^°^''^k2{l)~k2(2)) (1 - ??)z2fe3{2) | , (6.20) 

(1-77)2:2^1(2) (1 -»?)22fc3{2) i'^ - ri)ziki(2) 



where we have used £i(i) — fs{i)/ Is{2) = Q (see Eq. (C5)) and gi(2) = 1- For the network under consideration g = 0.4 



and fci(/), fc2(/), k^i) for VIRGO and LIGO-I are hsted in Tabic IV. From Eq. (5.1) we compute 7ij. We find that 
Vdet II 7y II = Z2B{i]), with 

^ [7?(0.23 + 13.3377 + 6.5177^ - 20.077/3)]i/2 
^"^^ 4.57 + 6.9577 ' ^ ' ' 

which is a smoothly varying function of 77 that attains its maximum value of ^ 0.22 at 77 '--^ 0.6. 
The volume of the parameter space is 

V~1.76^T^Z2fs{Uax-U^n) , (6.22) 

and the corresponding number of templates is 

^^tot — Z2J sKsraax t,niin) ; (D.zoj 

A* 

where ntot is listed in Table |l[ 

We do not specify the computing speed for the sampling method in this case because it is not clear what noise 
curve one must choose to obtain one detector templates in ^. 

C. Three-detector netw^orks 

(a) Three non-coincident identical detectors witti identical noise PSDs and identical orientations 

We consider a network of three detectors with identical noise PSDs. The detectors are spatially separated and have 
identical orientations. Such a situation will be difficult, if not impossible to realize on a spherical Earth. However, in 
this simple case our goal is to obtain order of magnitudes estimates for computational costs, etc. We treat one of the 
three detectors to be the fide. We choose the coordinate system of the fide as follows: The z-axis of the fide is along 
the line joining the fide and one of the remaining detectors. Thus, the second detector is located at (0,0, 22)- The 
X-axis is chosen such that the plane formed by the network coincides with the x — z plane. The spatial coordinates of 
the third detector are (0:3,0,23). The detectors have identical orientations, i.e., afj\ = = /?(/) = 7(/). Hence, they 
have identical antenna-pattern functions, i.e., Q'^ = Q'^ = Q''^ . Then, the network statistic simplifies to 

L=|C-Q'| 

^_L|c* + C2* + C3*|. (6.24) 

Note that, the network of three spatially separated detectors provide two independent relative time delays, T(2) and 
T(3), which determine the two possible source directions as follows. For each pair of detectors in such a network, the 
time delays draw a circle in the sky for possible source locations. The intersections of two such circles determine two 
possible source directions. Here, the time delays in units of f~^ are fsT[2) = 2^2(2 • n) and /sT(3) = a:3(x • n) -I- 23(2 • n). 
We measure Z2, 23, and 2:3 in units of fiducial wavelength As. 
The exact scaled metric is obtained via Eq. ( 5.16| ) to be 



i,j 



where the $(/) can be obtained from Eq. (5.13) by using for the time delays, /sArjij — 0, /sAt(2) — Z2A7^3, and 
/sAr(3) = X'il^ni -f Z3A7J3. The resulting components of the symmetric metric ga|3^ on the space of the variables 
{Istcjsi,n^,ni}, are 
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/ fci 


fcs 


i 




fca 
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V ■ 





9a0 

Maximization over fst,. gives 
Then the proper volume is 



(2:2 + 2:3)^1/3 xafci/S 

(22 + 23)fc3/3 a;3fe3/3 

[3(2| + 2|)j(l) - (22 + 23)2j(4)']/9 X3[323j(l) - (22 + 23)i(4)2]/9 

a;i(3j(l)-j(4))/9 






(6.26) 



(fcifc2 — k^)/ki 





V = {2TifV = 





2(2! +zl- 2223)j(l)/9 2:3(223 - 22)j(l)/9 

a;3(223 - 22)i(l)/9 2xij(l)/9 



167r4 
3V6 



Hl)kAfs{in 



srnin) ? 



(6.27) 



(6.28) 



where A is the area of the network. For the three-detector network discussed here, the parameter space is three- 
dimensional and the number density of templates is psin) = 3\/3/(8/i'^"). For typical parameters the quantities of 
interest are listed in Table pi 

(b) The network comprising of LIGO (Livingston), LIGO (Hanford), and VIRGO (Pisa) with their respective noise curves. 

We finally discuss the case of a non-coincident three-detector network involving the LIGO and VIRGO detectors 
with their respective noise curves. We denote such a network as LHV. The detector noise PSDs are represented by 
analytical fits given in Ref. Ipll . These fits are reproduced here in Table IV . We assume LIGO-I noise in both of 
the LIGO detectors. Table | lists the locations and the orientations of the detectors on the globe 132]. In order to 
compute the metric we choose the fide frame to be the network frame with L at its origin, H lying on the 0-axis, and 
V lying in the x — z plane. In units of As, the dimensionless position- vectors of the detectors are given by 



rL = (0,0,0) 



rH = (0,0,0.40) 



ry = (1.05,0,0.29) 



(6.29) 



Note that since the j/-component of each of these vectors is zero in such a frame (i.e., '''(7)2 = 0, for I =1, 2, 3), the 



moment functionals given in Eq. (C5) simplify considerably. An inspection of that equation shows that in this case 
the noise moments do not depend on the direction to the source and, hence, the g^nujafs s-re constants. Therefore, 
this choice of the fide frame simplifies the computations involved. The metric Oq,^ however, depends on rii and n^ 
through Q. The orientations of the detectors can easily be obtained from Table |. In the fide (network) frame, they 
are given by 



{aL,/3L,7L} 

{aH,(3H,'-fH} 

{av,Pv,lv} 



{38.11°, 256.35°, 107.43°} , 
{38.09°, 283.54°, 196.88°} , 
{320.34°, 275.92°, 159.02°}. 



(6.30) 



TABLE I. Locations and arm orientations of Earth-based interferometric gravitational-wave detectors. The length of each 
arm is given in meters. The location of the corner station of each detector is given in terms of the latitude and longitude there. 
The orientation of an arm is given by the angle through which one must rotate it clockwise (while viewing from top) to point 
north. 



Project 


Location 


Year 


Length (m) 


Corner 


location 


Arm 1 


Arm 2 


TAMA-300 


Tokyo, JPN 


1998 


300 


35.68°N 


139.54°E 


90.0° 


180.0° 


GEO-600 


Hannover, GER 


1999 


600 


52.25°N 


9.81°E 


25.94° 


291.61° 


VIRGO 


Pisa, ITA 


2000 


3000 


43.63°N 


10.5°E 


71.5° 


341.5° 


LIGO 


Hanford, WA, USA 


2000 


4000 


46.45°N 


-119.41°E 


36.8° 


126.8° 


LIGO 


Livingston, LA, USA 


2000 


4000 


30.56°N 


-90.77°E 


108.0° 


198.0° 


AIGO 


Gingin (Perth), AUS 


TBA 


80 


-31.04°N 


115.49°E 


180° 


270° 



The n um erical code th at we have developed for this case first computes g{i){j)ai3 from the moment functionals given 
in Eq. (C£). Then Eq. ( 5.23| ) is used to compute g±2,±2ai3 and, subsequently, the metric gaf3 is obtained from Eq. 
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( ^.25 ) as a function of ni, n^ (analogously, {9, 0}), e and ip. This metric is then projected orthogonal to i5° to obtain 



7ij. Since our goal here is to get estimates (within a factor of 10) of the online computational speed requirements, we 
obtain the parameter-space volume of such a search by integrating ^det || 7 || over the parameters 9, cfi, and ^ for a 
few chosen values of e and ip. From this volume we derive the number of templates, ntot, the computational cost, and 
the online speed needed for this network. 
The parameter-space volume is given by. 



V ~ fs^max / / Vdet II 7 II dnidns . (6.31) 



We have evaluated this integral for several values of e and tp. Its average value turns out to be ~ 250. For most of 
the astrophysically interesting ranges for e and ip, the proper volume does not vary by more than a factor of 3 beyond 
this value. As before, taking fs = 40 Hz, £,rnax ^ 138 sec, p^ifJ, = 0.03) = 125, we find the num ber o f templates to 



be ritot ~ few times x 10®. The computational cost using a template bank is obtained from Eq. ( 5.3C ). This cost is 
essentially given by the search over the time-delay window and, hence, 

Cb =^ 2mntot ■ (6.32) 

If we take a data train 5000 sec. long corresponding to £,(i)max = 1588 sec. for VIRGO, we have N ~ 10^, and the 
computing cost is Cb ^ 10^^ fl-pt ops. For online processing, this data must be processed in about 3412 sec, yielding 
an online speed requirement of about few tens of Tflops. 

D. Numerical results 

In Table ||, we summarize the numerical results for various networks. We list the total number of templates, ritot, 
required for a search over ^, 9, and (j) and also the break up into rig and n^. In the case of a two-detector network, 
n§ corresponds to a 1-dimensional grid in 9. We also give the corresponding values for «§. Finally in the last two 
columns we provide the online computational speeds where we have taken the data train to be of 500 sec. duration 
and the zero-padding is 500 — ^max = 362 sec. long, except for the LV network. In the LV case, the duration of 
the data train is 5000 sec, and the zero-padding is 3412 sec. The computational speeds Sb and Ss are obtained by 
dividing the computational costs by the duration of the padding. 



Network 


ritot 


ni 


ng 


ng 


Cb 


Cs 


Sb 


Ss 


configuration 










(fl-pt ops) 


(fl-pt ops) 


(Gflops) 


(Gflops) 


/ 


8.9 X 10'' 


4.4 X lO-" 


- 


- 


5.3 X 10'" 


- 


1.5 


- 


II -a 


1.6 X lO'' 


6.2 X 10^ 


129 


170 


2.7 X 10" 


2.5 X 10^^ 


75 


69 


II -b 


7.7 X 10^ 


6.2 X 10^ 


62 


170 


1.4 X 10" 


2.5 X 10^=* 


39 


69 


LH 


1.9 X 10^ 


6.2 X 10^ 


15 


40 


4.5 X 10"" 


6.9 X 10^^ 


12 


19 


LXv 


5.3 X 10^5 


6.2 X 10^ 


43 


105 


1.0 X 10*^ 


1.6 X 10^=* 


28 


44 


LXt 


6.2 X 10= 


6.2 X 10^ 


50 


128 


1.1 X 10" 


1.9 X 10^=* 


33 


52 


LXa 


8.0 X 10^ 


6.2 X 10^ 


65 


166 


1.4 X 10" 


2.4 X 10^=* 


39 


69 


LV 


3.5 X lO'' 


1.9 X 10* 


92 


105 


6.1 X 10" 


- 


170 


- 


III 


3.6 X 10** 


7.6 X 10^ 


2.4 X lO"* 


1.5 X lO** 


8.6 X 10'^ 


3.2 X 10^^ 


2.3 X lO** 


8.8 X 10^ 



TABLE II. The table lists number of templates, computational costs, and online computing speeds required for a search 
using specific networks. The detector networks are labeled as I for a single detector, II — a for two identical detectors with 
identical orientations located diametrically opposite on the surface of the earth, II — b for two identical detectors with arbitrary 
orientations located diametrically opposite on the surface of the earth, and III for three identical detectors with identical 
orientations placed on Earth's equator forming an equilateral triangle. The detector Xo denotes a detector with LIGO-I noise 
at the location of the detector D. The letters L, H, V, T, and A denote the detectors, LIGO detector at Louisiana, LIGO 
detector at Hanford (the one with arms of length 4 km each), VIRGO, TAMA and AIGO sites, respectively. We assume LIGO-I 
noise for both the LIGO detectors. We take the lower limit on the masses to be O.5Af0, i.e., Mi and M2 each > 0.5Mq. Thus, 
we have ^max ~ 138 sec. for fs = 40 Hz, except for the LV case (where fs = 16 Hz). We consider data trains of 500 sec. 
sampled at 2 kHz so that A'' ~ 10^, n| — 4.4 x 10^. For the LV network, £,(i),nax ~ 1588 sec. and, therefore, for a data train 
of length 5000 sec, one finds N ~ 10^. 
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VII. FALSE ALARMS, DETECTION PROBABILITIES, AND VETOES 
A. False alarm and detection probabilities 

The maximum-likelihood method involves computing the likelihood ratio for given data and comparing it with a 
predetermined threshold. In some cases it is more useful to replace the likelihood ratio by another quantity derived 
from it. When the likelihood ratio, or the LLR, is a function of several parameters, it is often possible to maximize it 
analytically over some of these parameters, as we have shown here. In our case, such a maximization led to a reduced 
statistic derived from the LLR. We call this statistic A. To detect the presence of a signal in the data one must, 
therefore, compute the values that A takes at all the grid points on the space of the remaining parameters. Each of 
these values must then be compared with the threshold, Aq, to infer the presence or absence of a signal. The value of 
Aq can be obtained via the Neyman-Pearson decision criterion |3|] , given the predetermined value of the false-alarm 
probability, Qq, associated with the event of detection of the signal. When A < Aq, we conclude that the signal is 
absent in the data, whereas when A > Aq, the detection of the signal is announced. 

To compute the false-alarm probability and the detection probability, Qd, we need to know the probability distri- 
bution of A in the absence of the signal, i.e., po{A), and in the presence of the signal, i.e., pi{A). We note that A is 
a sum of squares of the random variables Cq , cJJT/j, Cq , and c~,2. If our assumed properties of the detector noises are 

valid, then in the absence of a signal, i.e., for hypothesis Hq, each of the random variables Cq , c^/2, Cq , and c^i2 has 
a mean equal to zero. To see this, let 

C = co + ic^/2 , (7.1) 

where 

Co = {c'q} and c^/2 = {4/2} • (7.2) 

Further, define 

cj = v=^ • Co and c^^^ = v=^ • c^/2 . (7.3) 

Then, from Eq. ( [I.^] ) it follows that Cq, c^^/2 and, therefore, Cg and c^/2, each has a vanishing mean. From the 
assumed independence of noise among the different detectors and the orthonormality between v+ and v^, and also 
between s^ and s^/2, we obtain the covariances between Cq and c^,^ as well as the covariances between Cq and c^,^ 

to be zero. On the other hand, the variances of each of Cq and c^,^ is unity. Thus, under the Hq hypothesis, A 
is the sum of squares of the independent Gaussian random variables [see (I.19|)] with mean zero and unit variance. 



We conclude from standard literature (see, e.g., Ref. |2^]) that the probability density function for A, under the Hq 
hypothesis, is a x^ distribution with four degrees of freedom, and is given by 

Po(A) = ^exp(A/2). (7.4) 

The false alarm probability Qo is then obtained to be 

Qo = y °° PoiA)dA = (l + y) e^P(-Ao/2) • (7-5) 

The value of Qq, which is inferred from astrophysical estimates of event rates and detector sensitivities, determines 
the detection threshold Aq through the above equation. 

The detection probability is obtained from the probability distribution pi (A) . In order to calculate pi (A) , we need 
the norm of the average network correlation- vector when the template gives a perfect match with the data. Assuming 
that the strength of the signal in the data is b, i.e., if x^ (t) — bs^ (t) + n-' (t), then the average value of the network 
correlation- vector is C = bQe~"''= . Therefore, 

\\Cf -c2+4/2 

= ~o+^o+~'/2+^'/2 = b' , (7.6) 

for which we obtain (see, e.g., Ref. pTj) 
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Pi (A) 




exp 



A- 



/i(bVA) 



where /i is the modified Bessel function. The detection probability itself is 



Qd 



Pi{A)dA 



(7.7) 



(7.^ 



which we now obtain in the large SN R li mit. In terms of the network statistic, L = vA, this asymptotic limit amounts 
to the condition bL >> 1, and Eq. ( JZ.TJ ) approximates to a Gaussian distribution: 



Pi(L) 



/27r 



cxp 



(L-hY 



(7.9) 



Thus, in the large SNR limit the network statistic is a Gaussian with mean approximately equal to the network 
strength of the signal, b. For the networks considered in Sec. Vl, we summarize in Table HI detection thresholds, 
the resolution achievable in the direction to the source, and the relative sensitivity of the network compared to that 
of a single detector. We take the false-alarm rate to be one per year and the detection probability to be 95%. Then, 
assuming a sampling rate of about 2 kHz, we arrive at a false-alarm probability of Qq ~ 1.5 x 10~^^ /ritot- For the 
sake of this calculation, we assumed that output samples in the correlation vector are uncorrelated. The correlation 
between these samples will reduce Qq but this does not make appreciable difference to the thresholds and sensitivities 
Q. The threshold Aq is then computed using Eq. (7.5). 
We define the sensitivity of an M-detector network relative to that of a single detector to be equal to 

\l^i^i gfjJgf^JhQ(^i-^ + AL)/(Lo(M) + ^^), where Lo(m) is the threshold corresponding to a network of M detectors 
(therefore, Eo(i) is the threshold for a single detector) and AL is the solution of 



1 



/2tt J^ 



aL 



e^^'/^dx = Q, 



0.95. 



(7.10) 



It yields AL « 1.64. The sensitivity (which is > 1) is roughly proportional to the average distance at which one can 
detect a source of a given ^ with a network of M detectors as compared to a single detector, with 95% detection 
probability, where the average is taken over all directions and orientations of the binary. For a network of detectors 

with identical orientations the result is exact. 

The resolution in the direction to the binary is obtained from Eq. ( |3.26 ) . It is obtained by noting that the error in 
the 71,3 ^ ni plane is given by 



-2^,^22,^33 



(^23)2 



sm a cos cp crji 



(7.11) 



where an is the resolution given in terms of the angles 9 and cj). We take SNR ^ 12, sufficiently above the threshold to 
guarantee a good detection probability. We compute the source-direction resolutions ag and a^ in the case of two- and 
three- det ector networks, respectively. The an is obtained from the covariance matrix using the above equations. In 
Table III , we give the values for source-direction resolution for a direction normal to the plane in which the detectors 
lie. The reason we choose this direction is because in this direction we expect the resolution to be high. The big 
difference in the values of an between case /// and LHV is because /// is a degenerate case of a network of identical 
detectors, which are merely spatially separated. 



B. Vetoing non-Gaussian events in detector noises 



The assumptions of Gaussianity and stationarity of noise in detectors is an idealistic one. The noise in actual 
detectors will not, in general, satisfy these assumptions, but will rather contain a non-Gaussian component arising 
from causes such as sudden strain releases in mechanical structures, ringing from electronic servo loops, etc. Their 
deviations from Gaussianity are poorly understood and are difficult to model. But, since such noise components usually 
have high amplitude, they can trigger the statistic L to register a "detection" within the scope of the methods described 
so far. This is where a x^-type test described in Ref. p?7| can be used to discriminate against such contingencies by 

~ 2 

using the specific spectral profile of a chirp (|<S'(/)| oc f~'^^^), which is different from that of a non-Gaussian event, 
in general. We briefly describe the test below. 
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TABLE III. Total number of templates, false-alarm probabilities, detection thresholds, relative sensitivities, and the resolu- 
tions in the direction to the source for various network configurations. We have gfu/gm ~ 1-58, where the subscripts 1 and 2 
correspond to VIRGO and LIGO, respectively. The <;(/) are needed to compute network sensitivities relative to that of a single 
detector with LIGO-I noise. 



Network 


ntot 


configuration 




/ 


8.9 X 10^ 


II -a 


1.6 X 10^ 


II ~b 


7.7 X 10^5 


LH 


1.9 X lO'^ 


LXv 


5.3 X lO'^ 


LXt 


6.2 X 10^5 


LXa 


8.0 X lO'' 


LV 


3.5 X 10^ 


III 


3.6 X 10* 


LHV 


3.5 X 10* 



Qo 



-'O(M) 



1.7 X 10"''' 
9.4 X 10"^* 
1.9 X 10~'^ 
7.9 X 10"^^ 

2.8 X 10~^^ 
2.4 X 10"'^ 

1.9 X lO"^'' 
4.3 X 10"^* 

4.2 X 10"^° 

4.3 X 10"^° 



Relative 


sensitivities 


1 


1.3 


1.3 


1.4 


1.4 


1.3 


1.3 


1.5 


1.6 


1.7 



ere 
or an 



8.7 
9.3 
9.2 
9.0 
9.2 
9.2 
9.2 
9.4 
9.9 
9.9 



0.3° 

0.6° 

2.5° 

0.9° 

0.8° 

0.6° 

0.7° 
0.15 sq.deg. 
1.2 sq.deg. 



The frequency bandwidth in each detector, from / = to / = /Nyquist, is divided into p-subintervals in the following 
way. Let, 

'fk(i) 



{x\y^)(i)k 



25R 



-df 



(7.12) 



Sh{I){I) 

where in the integrand the index / is not summed over. Using the above definition, one partitions the interval 
[0, /Nyquist] by setting (s^,So)(/)fc = i^i/2^^i/2){i)k = l/(2p)- This way, for each detector, /, we get a partition 
0, /i(/)i /2(/)j •••7 fp{i) — /Nyquist- Ncxt, One computes the following correlation over each subinterval k: 

Clfc^(5^x^)(,)fe. (7.13) 

The individual detector-correlations can, therefore, be expressed as 



C' 



p 

fe=i 



Define the deviation in Cj, from the average contribution to C^ from the k-th frequency bin to be 

ACi EE Ci - C'/p , 
which, by definition, obeys X]fc=i ^^l — 0- Then the x^ statistic is given by 



X(/) 



fc=l 



/|2 

k\ 



(7.14) 



(7.15) 



(7.16) 



which has 2p — 2 degrees of freedom. If the detector / is behaving "properly" , that is, if the detector output is mainly 
the Gaussian noise, with or without a chirp, then xfj) has a small value. On the other hand, a relatively large value of 
xfj) is taken to indicate non-Gaussianity. Choosing p = 20, as in ReL Q, we see that xfj) has 38 degrees of freedom. 
Defining xt to be the decision threshold, if xfi) > X*i then we reject the hypothesis that the event is a signal, else we 
accept that there is a signal present. For 38 degrees of freedom at 90 percent confidence level, one finds xl ^ 50. 

We apply the test to the network in the following way. Suppose, the statistic L exceeds the threshold Lq for some 
time lag t — tq. After accounting for the relative time delays, we compute the C/j.-, at tq and construct xfi)! for ^U 
/. Ncxt, we test whether each detector satisfies the assumption of Gaussianity and stationarity by comparing xfj) 
with xi ■ If xfi) < X* : for any J, then we accept the decision that we have actually detected a signal. On the other 
hand, if for some J = /i,72,/3, ...,Imi, xfj) > X*i ^^ ignore the contribution from these detectors in computing L 
and construct L' for the rest of the data from M — Mi detectors. Now, if L' crosses the threshold, then we say that 
the signal is detected, otherwise it is not. If non-Gaussian events occur relatively rarely, then it is unlikely that more 
then one detector simultaneously will have such events. Then Mi on most occasions will be unity. This concludes a 
simple generalization of the vetoing used for single detectors to the case of a networks. 
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VIII. CONCLUDING REMARKS 

We have presented here a data-analysis strategy based on the maximum-hkehhood method for the detection of GW 
signals from inspiraling compact binary stars with a network of laser interferometric detectors. Our approach is based 
on a coherent search of the data from all the detectors in a network and, therefore, is inherently optimal. The formalism 
described is mathematically elegant and simple. In Gaussian noise, the method is tantamount to matched filtering 
the signal. However, the noise model we consider here is more realistic in that it allows for occasional non-Gaussian 
bursts superposed on a predominantly Gaussian noise background. Sections of data that contain non-Gaussian bursts 
are vetoed out by a x^ criterion. For simplicity, we consider the Newtonian inspiral waveform, but it is clear that our 
formalism is as well applicable to waveforms depending on a larger number of intrinsic parameters, such as spins of 
the binary members. In particular, the formalism is extendable to the restricted 2.5 PN inspiral waveform. In that 
event the number of network templates required increases essentially by the same factor as in the case of a single 
detector: Assuming LIGO-I noise in the detectors and individual stellar masses of 0.5Mq or larger, the increase in 
the number of templates is by a factor of about 4 to 5, when the maximum allowed mismatch is 3%. One would then 
expect the computational cost to increase by a similar factor. We expect to look into this issue in greater detail in 
the future. 

For the Newtonian case, the online computational speed requirements are high - from tens of Gflops for a network 
of two detectors to a few tens of Tflops for a network of three widely separated detectors around the globe. Clearly, 
efficient signal extraction methods are called for. A hierarchical search approach to this problem should, therefore, be 
explored. Alternatively, the search can be restricted to selected regions in parameter space, the selection of regions 
being based on prior astrophysical information. For example, the search may be restricted to the masses of the stars 
being greater than a solar mass, the argument being that it is unlikely to find compact objects of a smaller mass. 
This would reduce the computational cost by a factor of 4 in the Newtonian case. 

The relative sensitivity of a network, on an average, increases by a factor little less than \/M, where M is the 
number of detectors having identical noise PSDs. Although the signal energy accessible to the network on an average 
(with the average taken over all the directions and the orientations of the binary) increases by a factor of M, the 
change in the threshold value of the network statistic, which obeys the Rayleigh distribution in the absence of the 
signal, is such that the overall factor of increase in sensitivity is a little less than v^M. In the case of detectors with 
different noise curves, the quantity g/j\ plays a central role. In such a case, the relative sensitivity of a network on 



an average increases as \/J2i=i9(i)/9(i) where the sensitivity is normalized to detector 1. Thus, a detector with a 
larger value of gtj\ influences the relative sensitivity that much more. 

We also estimate the errors made in determining the direction to the source by computing the covariance matrix 
which is just the inverse of the metric obtained in the parameter space divided by the square of the SNR. We find 
that for a network of detectors spread around the globe, with all detectors having LIGO-I noise curve and an SNR of 
12, the resolution is about a fraction of a degree. 

Our analysis essentially assumes Gaussian noise (with occasional non-Gaussian bursts). The fact that real detectors 
produce non-Gaussian and non-stationary noise makes this issue highly relevant. This issue must be addressed more 
thoroughly in the future. Since the signals are generally weak in nature, it is desirable that the search strategy be 
optimal. If a simple enough mathematical model that adequately describes the noise in the real detectors can be 
given, then our approach based on the maximum likelihood method can still be explored. Creighton has already 
investigated this approach where the model for noise contains Poisson-distributed bursts superposed on the usual 
Gaussian component g^. Such an approach seems promising and could be investigated further. Another approach 
that is simple, but suboptimal is that of matching event lists in each detector and putting thresholds on estimated 
source parameter differences. 

These issues for more realistic noises and signals still remain to be addressed. Here our main goal in this work is 
to provide a general framework based on the method of maximum likelihood, which uses a coherent approach and is 
therefore optimal. Also many of the results we obtain here may be scaled up in a straightforward way to obtain order 
of magnitude estimates in more general situations. The extension to the PN waveform is just one such case. 
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APPENDIX A: STF TENSORS AND GEL'FAND FUNCTIONS: A REPRESENTATION OF SO{3) 

To understand the relation between the responses of two different detectors in a network to the same incoming chirp, 
it is useful to study the behavior of the detector and wave tensors under three-dimensional orthogonal transformations. 
This is tantamount to developing an understanding of STF tensors (of rank 2) under the action of an element of the 
rotation group S0(3), g{a,f},^), where (a,/3, 7) are the Euler angles. Since we extensively deal with STF tensors of 
rank 2 in the text, enunciating some frequently used properties of such objects is in order .[j 

Any STF tensor of rank I can be expanded in a location-independent basis of "STF-/" tensors, y^^^, which has a 
dimension of 2/ + 1. STF-Z tensors, with rank I — 2 are related to the spherical harmonics as follows: 



Y2„^{0, 0) = y'Ln^nJ , where m = ±2, ±1, , (Al) 

where n = (cos sin 0, sin sin 6*, cos 6*). The "STF-2" tensors defined above are also called spin- weighted spherical 
harmonics. They obey the following normalization relation 



3^2^„3^if* = ^C • (A2) 



When one makes a passive orthogonal transformation of frames through the Euler angles {a, /S, 7}, the angles {9, cj)} 
get relabeled to, say, {0' , (j)'}. Then, the spherical harmonics in the new frame can be expanded in terms of those in 
the old frame as, 

>^2m(^',0')-r„"(a,/3,7)>^2„(e,0) , (A3) 

where the right-hand side has an implicit summation over n — 0, ±1, ±2. Above, the expansion coefficients, T^", are 
the Gel'fand functions of rank 2. 

The group composition law of two elements of the rotation group, say, gi{(j),9,^) and g2{a^[3^^) leads to the 
following addition theorem for the Gel'fand functions: 

r™"(0',0',V') = r„/(0,0,^)r,"(a,/3,7) , (A4) 

where once again the summation over s = 0, ±1, ±2 on the right-hand side is understood. The transformation of 
STF tensors under rotation is governed by the above theorem for Gel'fand functions. 

APPENDIX B: GW POLARIZATION TENSORS AND THE DETECTOR TENSOR 

The detector tensor for an interferometer is defined as 

dij = sm2& [nunij - n2in2j) , (Bl) 

where iii and n2 are the unit vectors along the arms of the interferometer and 20 is the opening angle, i.e., the angle 
between its two arms |^]. Here we shall take the detectors to have orthogonal arms. In that event 

ni = ^(1,-1,0), n2 = -^(1,1,0) , (B2) 

in the detector frame. When referred to the fide frame, however, ni_2 depend on the Euler angles, {a,/3,7}, that give 
the orientation of the detector relative to the fide. The dependence of dij on these angles can be expressed in a neat 



form by realizing that it is a second-rank STF tensor, as is evident from Eq. (Bl). Thus, it can be expanded in a 



basis of STF-2 tensors. It can be shown that the expansion coefficients in such a basis are Q 

dijyt = -^\[^{T^*n{o^.Pn)-T-^*n{a,Pn)) , (B3) 

in the fide frame. Above, n = 0, ±1, ±2, and the Gel'fand functions depend on {a,/3,7}. 



For a detailed discussion, see Refs. Pq,B6[. For a more selective reading of immediate relevance, we refer to Ref. 

36 



Similarly, the corresponding components of dij can be deduced in the wave frame. Apart from depending on the 
angles {a,/3, 7}, these coefficients will also depend on the orientation of the wave frame relative to the fide, given by 
{(p,9,tjj}. Using the addition theorem for Gel'fand functions, these components are 



d^,yz 



1.3 ■^2n 



V Stt 



,V')(r2%(a,/?,7)-r-2%(«,/3,7)) 



-Dr, 



(B4) 



in the wave frame. The extended beam-pattern function (3.16) depends on the coefficients, D„, with n = ±2 and 
^- = 0. 



APPENDIX C: NOISE CURVES AND NOISE MOMENTS 



We define the moments of the /-th detector's noise curve as. 



«(/)(9) = Sh(i){fo(i)] 



fail)/ 1^(1) 



dx 



-9/3 



Sh(I){xfs(I)) 



(CI) 



where /o(/) denotes the "knee" frequency of that detector; this is the frequency at which the sensitivity of the detector 
is the highest. On the other hand, fc{i) is its high-frequency cut-off and /s(/) is the seismic cut-off. The noise moment, 
iij\{7), is related to the normalization, gu), as follows: 



8 



4/3. 



2 '-' — */0- /-.N 

3(1) = ^ 77 \S(i) Hi)\') ' 



(C2) 



where qii\ = fs(i)/fs- Since our templates are normalized using the above factor, we find the following ratio useful 
in our calculations of the parameter-space metric: 



j{i){q) = *(/)(<?)/*(/) (7)- 



(C3) 



In this paper, we evaluate these noise moments using the analytical fits to noise PSDs of different detectors given in 
Ref. [|l). These fits are presented in Table 

TABLE IV. Analytical fits to noise PSDs, Shif), of the interferometric detectors studied in this paper. Here sq denotes the 
minimum value of Sh{f), and /o is the frequency at which this minimum occurs. We take Sh{f) to be infinite below the seismic 
cut-off frequency fs- We choose the high frequency cut-off, fc{i), to be 800 Hz for all /. 



Detector 



Fit to noise PSD 



so/32 |(/o//)' + 13(/o//) +9[1 + if/fof]} 

so/5 [4(/o//)^/^-2 + 3(///o)2] 

so/4 [290(/.//)5 + 2(/o//) + 1 + if/fof] 

so/3 [ifo/fr + 2(///o)'] 



so (H2 



-l^ 



2.4 X 10""'' 
6.6 X 10"""^ 
1.1 X lO""*^ 
8.0 X lO""*^ 



/o (Hz) 



/. (Hz) 



TAMA-300 
GEO600 
VIRGO 
LIGO I 



400 
210 

475 
175 



75 
40 
16 
40 



There are certain combinations of the noise moments that appear frequently in the expression for the metric on the 
parameter space relevant for a network (see Sec. VI). In order to simplify these expressions, we define the following 
noise-moment combinations: 



ki{i) = 



«2(/) 



J(/)(1)-Jf/)(4)J, 
i(/) (17) -jf/) (12)1/25, 



fc3(7)=3[j-(/)(9)-J'(/)(4)j(/)(12)]/5 , 



(C4) 



which are, in general, different for detectors with different noise P SDs. 

The moment functionals {^a){i) and ($c$^)(7) defined in Eq. ( 5.19 ) for the /-th detector, can be given in terms 
of the moments of its noise curve. They are (for ri2 7^ 0) 
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($o)(/) =27r£.(j) j(/)(4) , 
(*i)(/)-6^g-,f j(,)(12)/5 , 
($2,3>(/) = 27r [r(/)3,i - r(j)2n3,i/n2] £»(/) J(/)(4) 

= [''(/)3,l - ?'(/)2"-3,l/"2] ("I'o)(7) , 

((<i>o)')(/)=4^'e(/)J(/)(l) ' 

($0*2,3) (/) = 47r^ [»^(/)3,l - ''(/)2?l34/'^2] fi*^/) j(7)(l) 
= [''(/)3,1 - ?'(/)2"-3,l/"2] (($o)^)(/) , 

((<i>i)')(/)-36^2g-,;°/^j(,)(17)/25 , 

($1$2,3)(/) = 127r^ [''(/)3,l - ''(/)2"3,l/"2] e^/) J(/)(9)/5 
== [^(/)3,1 - ''(/)2"-3,l/"-2] (*0$1>(/) , 

(($2,3)^) (/) = 47r^g^^) [''(/)3,1 - ?'(/)2"3,l/'^2] i(/)(l) 

r 1 2 9 

= ^(7)34 - ?'(/)2n3,l/"-2j ((*o) >(/) , 

($2*3) (/) = 47r2£)(^) [r(/)3 - r(/)2n3/n2] [r(/)i - r(/)2ni/n2] j(/)(l) 

= [''(/)3 - r(i)2n^/n2] [r{i)i - rj^i/riz] (($o)^)(/) , (C5) 

which shows that all the moment functionals are expressible in terms of five independent noise-moments, j{i){q)- 
These are the ones corresponding to g = 1, 4, 9, 12, 17. The values of these noise moments and the combinations 
(C4) for relevant noise PSDs are listed in Table M. Alternatively, all the moment functionals are determined by five 



basic ones, namely, ($o)(7), (*3)(7), (*o)(/)> (*o*3)(/), and ($i)(/) 

For a network of three or less detectors a plane can always be arranged to contain the hubs of all the detectors. 
This makes it possible to choose the fide frame (or the network frame, in this case) in such a way that r(7)2 = for 
all /. With this choice the moment functionals reduce to 

($o)(/) =27rg(j) j(7)(4) , 
($i)(,)-6^e-f j-(,)(12)/5 , 

(*2,3)(/) =''(/)34(*0>(/) , 
(($0)')(/)=4^2£<^^)J(,)(l) , 

(<Do$i)(/) = 12^2^-,5/^^-(,)(9)/5 , 

($0*2,3)(/) =r(/)34((*o)^)(/) , 

(($i)^)(,)=36^2g-;°/^(,)(17)/25 , 

($1*2,3)(/) =r-(/)3,l(*0$l)(/) , 
(($2,3)')(/)=rf,)3^l(($0)')(/) , 
(^2$3)(/) = r-(/)3r(/)i(($o)^)(7) • (C6) 

The moment functionals simplify significantly in this case. 

APPENDIX D: A NETWORK OF TWO IDENTICAL DETECTORS WITH THE SAME NOISE PSD BUT 

DIFFERENT ORIENTATIONS. 

Consider a network of two identical detectors having orientations {ck(i), P[i), and 7(i)} and {a(2), P{2)7 and 7(2)}: 
respectively, which we take to be different. Then the beam-pattern functions of each detector are dependent on 
{»(/), /3(/), 7(7): Cj "01 ^5 '/'Ij where 7 = 1,2. Here, we prove that for a given set of values for the detector orientations and 
source-direction, {9, </>), the function |(5^||Q^| can always attain the maximum value of 1/2, with jQ^j = \Q^\ — l/-\/2. 
This proof is assumed in obtaining the result in Sec. |ll|. 
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TABLE V. Noise moments of some of the planned interferometric detectors. In evaluating these, we take the values of /^(j) 
and fo(i) as given in Table vA The upper cut-off frequency, /c( j) , is assumed to be 800 Hz for all detectors. 



Noise moments 


j(l) 


j(4) 


j(7) 


j(9) 


j(12) 


j(17) 


fci 


fc2 


fc3 


LIGO-I 


21.3 


4.089 




0.444 


0.157 


0.045 


4.572 


0.007 


-0.1197 


VIRGO 


132.4 


7.774 




0.407 


0.167 


0.068 


71.99 


0.0145 


-0.5347 


GEO-600 


17.99 


3.49 




0.537 


0.273 


0.136 


5.809 


0.0222 


-0.2493 


TAMA 


20.94 


4.111 




0.443 


0.133 


0.045 


4.039 


0.0099 


-0.0625 


WHITE NOISE 


12.97 


2.574 




0.677 


0.453 


0.291 


6.351 


0.031 


-0.293 



Proof:- 

The network vector Q lies in the helicity plane H. Therefore, 



Q = 



-2q^ , n+2r 



O+'D 



+2 



(Dl) 



where Q+2 = D+2 • Q and Q-2 = D_2 • Q- Alternatively, for a two-detector network we can expand Q in the real basis 
of /'s on the network space. 



Q^QH + Q^2 



with Qi = 1 • Q and Q2 = 2 ■ Q. Using Eqs. (Dl) and (D2), wc find Qi and Q2 in terms of Q and Q^ as 



Q2 = iQ-^D^22 + Q^^D+2^)/ W D 



where £'±21 = 1 • D±2 and £'±22 — 2 • D±2- Thus, 



Q-^D^2i+Q+^D 



+21 



Q-^D^22+Q+^D^ 



22 



(D2) 



(D3) 



(D4) 



Let us assume that Qi differs from Q2 by an overall phase factor e'". Then, since IQ^P + |Q^P = 1 for such a network, 
we have \Qi\ = \Q2\ — l/\/2- We shall now prove that for a given set of values for the detector orientations, source- 
direction, and Lo, one can always find e and ip, wit hin their physically allowed range, i.e., e £ [0,7r] and ip £ [0, 27r], 
such that our above assumption is met. Equation (D4), therefore, leads to 



Q 



+2 



D. 



21 



^-226" 



D+22e'" - D+2i 
Texp(w) , 



(D5a) 
(D5b) 



where T and v are real numbers. Note that the RHS of ( D5a| ) can take any value in the complex plane. More 



importantly, this is true also of the LHS of (D5a) because Q /Q = T2 (?A, e, 0)/T2 (-0, e, 0) can always take any 



value on the Argand plane for astrophysically relevant ranges of e and ^. Thus, our assumption remains vindicated 
and, hence, one can always choose values for e and '0 that maximize the function IQ1IIQ2I to attain the value of 1/2. 
These values corresponding to the maximum are -0 = —v/i and e — cos^^[(T^/^ — 1)/(T^/^ + 1)]. 
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